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(54) Title: A STRUCTURED PEPTIDE SCAFFOLD FOR DISPLAYING TURN LIBRARIES ON PHAGE 

(57) Abstract: The invention is directed to a model system for structure-activity analysis of peptide or protein molecules involved 
in important biological processes. Provided by the invention are combinatorial peptide libraries comprising disulfide-constrained 
cyclic peptides with sequences favorable foreneigy stabilized conformations. One aspect of the invention is directed to cyclic peptide 
scaffolds that present p-hairpin structure in solution. Methods of selecting and using such peptide scaffolds are provided herein, 
which are useful for mimicking in vivo molecular interactions and designing therapeutic agents. Thus, the invention has profound 
utility for biological studies and drug development. 
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A STRUCTURED PEPTIDE SCAFFOLD FOR DISPLAYING TURN 
LIBRARIES ON PHAGE 

Technical Field 

The present invention relates in general to protein structure-activity 
5 relationship studies, and in particular to combinatorial libraries of conformationally- 
constrained peptides and methods of generating and screening such libraries for 
biological and pharmaceutical use. 



Background Art 

Structure-Activity Relationship (SAR) study provides valuable insights for 
10 understanding intermolecular interactions between a protein or peptide and other 
biologically active molecules. In their natural environment, peptides or proteins 
adopt unique, conformationally-constrained structures in order to recognize and bind 
to their binding partners, and to form a molecular complex therewith, which in turn 
elicit particular activities. Examples of protein-protein binding partners include 
15 enzyme-substrate, ligand-receptor, and antigen-antibody. Determination of the 
conformation of a peptide in its native form, therefore, become crucial for closely 
mimicking its in vivo activity and rationally designing its analogues which may be 
useful as drugs. 

Most small peptides are highly flexible and do not typically adopt unique 
20 solution conformations; in particular, they do not maintain the structure that the 
same sequence adopts in the native protein. The lack of fixed structure reduces the 
affinity the peptide might have for a target (for entropic reasons) and makes 
determination of the active conformation of the molecule extremely difficult. 
Because of this, many strategies have been described to introduce constraints into 
25 peptides (such as D-amino acids, disulfide or other crosslinks), or to replace parts of 
the peptide with more rigid non-peptide scaffolds. Indeed, such peptidomimetics 
have been widely used to perform structure-activity studies in a systematic way to 
provide information about the specific amino acid residues or fiinctional groups in a 
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peptide that are adaptable to a particular conformation and are important to 
biological activities. 

Several constrained protein scaffolds, capable of presenting a protein of 
interest as a conformationally-restricted domain have been identified, including 

5 minibody structures (Bianchi et al. (1994) J Mol Biol 236:649-659), loops on p- 
sheet tums, coiled-coil stem stmctures (Myszka & Chaiken (1994) Biochem 
33:2363-2372), zinc-finger domains, cysteine-linked (disulfide) structures, 
transglutaminase linked structures, cyclic peptides, helical barrels or bundles, 
leucine zipper motifs (Martin et al (1994) EMBO J 13:5303-5309), and etc. Of the 

10 identified protein scaffolds, P-tums have been implicated as an important site for 
molecular recognition in many biologically active peptides. Smith & Pease (1980) 
CRC Crit Rev Biochem 8:315-399. Consequently, peptides containing 
conformationally constrained P-tums are particularly desirable. The great majority 
of the identified P-tum bearing peptides are cyclopeptides which have been 

15 generated by the cyclization of a peptide similar to a sequence in the natural 
substrate. Milner- White (1989) Trends Pharmacol Sci 10:70-74. These 
cyclopeptides, however, may still retain significant flexibility. For this reason, many 
studies have attempted to introduce rigid, nonpeptide compounds which mimic the 
p-tum. Peptides with such nonpeptide P-tum mimic provide usefiil leads for drug 

20 discovery. Ball & Alewood (1 990) J Mol Recog 3:55-64; WO 94/03494 (Kahn). 

One of the revolutionary advances in drug discovery is the development of 
combinatorial libraries. Combinatorial libraries are a collection of different 
molecules, such as peptides, that can be made synthetically or recombinantly. 
Combinatorial peptide libraries contain peptides in which all amino acids have been 
25 incorporated randomly into certain or all positions of the peptide sequence. Such 
libraries have been generated and used in various ways to screen for peptide 
sequences which bind effectively to target molecules and to identify such sequences. 

Many methods for generating peptide libraries have been developed and 
described. For example, members of the peptide library can be created by split- 
30 synthesis performed on a solid support such as polystyrene or polyacrylamide resin, 
as described by Lam et al (1991) Nature 354:82 and PCT publication WO 
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92/00091. Another method disclosed by Geysen et al, U.S. Pat. No. 4,833,092 
involves the synthesis of peptides in a methodical and predetermined fashion, so that 
the placement of each library member peptide gives information concerning the 
synthetic structure of that peptide. 

5 Considerable* effort has been devoted to introducing structural constraints 

into combinatorial peptide libraries so that the member peptides represent more 
closely to their native counterparts. Houston et al U.S. Pat. No. 5,824,483 describes 
a synthetic peptide library containing peptides featuring a-helical conformation and 
thus capable of forming coiled-coil dimers with each other. McBride et al (1996) J 
10 Mol Biol 259:819-827 describe a synthetic library of cyclic peptides mimicking the 
anti-tryptic loop region of an identified proteinase inhibitor. 

A complementary method for peptide library-based lead discovery is display 
of libraries on filamentous bacteriophage. This method allows the preparation of 
libraries as large as lo'^-lO*^ unique peptide members, many orders of magnitude 

15 larger than libraries that may be prepared synthetically. In addition to large library 
sizes, advantages of phage display include ease of library construction (Kunkel 
mutagenesis), coupling of the binding entity (displayed peptide) to a unique 
identifier (its DNA sequence), a selection protocol for amplifying rare binding 
clones in a pool, and the high fidelity of biosynthesis (compared to synthetic 

20 methods). Furthermore, rapid and inexpensive selection protocols are available for 
identifying those library members that bind to a target of interest. However, only 
natural peptides composed of L-amino acids may be displayed on phage, so the 
problem of defining three-dimensional structure-activity relationships is more 
difficult than it might be for a constrained peptidomimetic containing non-naturally 

25 occurring peptides or nonpeptide compounds. One possible solution to this problem 
is to use the structural constraints of a folded protein to present small variable 
peptide segments. Indeed, several small, stable proteins have been proposed as 
peptide display scaffolds. Nygren 8l Uhlen (1997) Curr. Opin. Struct Biol 7:463- 
469; Vita et al (1998) Biopolymers 47:93-100; Vita et al (1999) Proc, Natl Acad, 

30 Set USA 96:13091-13096; Smith et al (1998) J, Mol Biol 277:317-332; 
Christmann et al (1999) Protein Engng. 12:797-806. Unfortunately, it is not clear 
that protein ligands obtained by this approach could be transformed to small- 
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molecule drug leads. Epitope transfer from proteins to small peptides or to non- 
peptide small molecules remains an extremely challenging problem. Cochran (2000) 
Chem. Biol. 7:R85-R94. 

Therefore, despite extensive studies of the rules governing conformational 
5 preferences in natural peptides and the existence of several peptide library systems, 
those features necessary for structural stability of natural peptides remain poorly 
understood. In particular, there has been little systematic or quantitative assessment 
of the effect of residue substitutions and non-covalent interactions on structure. 

Disclosure of Invention 

10 The present invention provides a novel model system for assessing 

individual residue contributions to the stability of a defined peptide scaffold and for 
evaluating a series of substitutions presented in a combinatorial peptide library. The 
peptides of the invention are cyclized via disulfide bond between two cysteines 
within the peptide sequences. Amino acid substitutions at various defined residue 

15 sites influence the conformation of the cyclic peptides and their structural stabilities. 
The invention also provides methods of screening for and analyzing cyclic peptides 
with a specific secondary structure, P-tum, which provides fiirther structural 
constraints to the peptides. The subject peptide library comprising a collection of 
turn bearing cyclic peptides can be used in screening for candidate biologically 

20 active molecules through molecular binding assays. Methods for such screenings 
are also provided by the present invention. The compositions and methods of the 
invention can be used in analyzing the structure-activity relationships of peptides of 
interest, thereby providing useful information for studies of molecular interactions 
involved in particular biological processes, as well as for rational design of 

25 therapeutic agents. 

Brief Description of Drawings 

Figure 1 depicts the design of bhp, a 10-amino acid model P-hairpin peptide. 
(A) Superimposed protein structures illustrate packing between disulfides and side 
30 chains of the closest non-hydrogen-bonded residues; (B) Schematic representation of 
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the bhp model P-hairpin peptide with the side chains of the non-hydrogen-bonded 
residues 1, 3, 8 and 10 shown. X represents the varied residue selected from 19 of 
the 20 natural L-amino acids (excluding Cys). 

5 Figure 2 shows the relative hairpin stability for substitution X in the bhp 

peptide sequence. (A) Cysteine effective concentrations (Ceff) relative to 
glutathione. Error bars are for ± one standard deviation; (B) Equilibrium free energy 
differences relative to the alanine peptide. 

Figures 3A-3B show two views of the minimized mean NMR structure of 
10 disulfide-cyciized P-hairpin bhpW, (A) Non hydrogen-bonded (NHB) strand 
residues Trp3 and Leu8 are highlighted. (B) Hydrogen-bonded (HB) strand residues 
are highlighted (Thr2, Thr9, Glu4 and Lys7). 

Figures 4A-4B depict NMR analysis of CD4 peptides. (A) Overlay of the 
fingerprint region of the COSY spectra for cdl and cd2. (B) NMR structure 
15 ensemble for cd2 (20 models; two orthogonal views) shown superimposed on CD4 
residues 37-46 from the crystal structure of gpl20-bound CD4 (PDB entry IGCl). 

Figure 5 shows circular dichroism spectra of three peptide pairs of Example 

2. 

Figure 6 shows effective concentration (Cen) values for substitutions X in 
20 the peptides of Example 3. The strand substitutions X are shown at the top of the 
graph, and the central residues of the turns are indicated to the right. 

Figure 7 depicts minimized mean structures of the tryptophan analogs of 
peptides in Example 3 overlaid on the backbone atoms of residues 1-3 and 8-10 
(RMSD of 0.36 and 0.30 A for 1 with respect to 2 and 3, respectively). Peptide 1 is 
25 in grey; peptide 2 is in black; and peptide 3 is in white. For clarity, non-proline side 
chain atoms are not shown for the four turn residues. 

Figures 8A-8B show effective concentration (Ceff) values for peptides with 
hydrophobic pairs in non hydrogen-bonded (NHB) strand positions as described in 
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Example 4. Values for substitutions paired with a cross-strand leucine are shown in 
(A); those for tryptophan pairs are shown in (B). 

Figure 9 depicts a Hammett plot comparing substitution free energy 
differences between the peptides of Example 4. 

5 Figure 10 shows double mutant analysis of the stability of W3Y8 relative to 

L3L8. 

Figures 1 1 A-l 1 B illustrate relative stabilities of bhpW analogues. Free 
energy changes (293 K) are calculated from the ratio of the cysteine effective 
concentration (Ccfr) for each peptide to that of bhpW (TT pair) using the relationship 
10 -AAG = RT]n (CcfT.xx/Cefr.TT). (A) Substitutions at position 2 (X2 series). (B) 
Substitutions at position 9 (X9 series). 

Figure 12 illustrates relative stabilities of bhp hairpins with valine at 
position 2 or 9. Solid bars indicate substitutions in the background of bhp W, while 
15 hatched bars correspond to peptides with the altered turn sequence KGNE. Free 
energy changes (293 K) are calculated from the ratio of peptide Ceirto that of the 
appropriate TT analogue (-AAG = RT In (Cefr.xx/Ccfr.rr)). 

Figure 13 shows correlation of position 3 substitution free energy 
20 differences for TT bhp analogues and for the HV, X3 series. The residue substituted 
at position 3 is indicated, and the slope of the plot is 1 .96 {R = 0.95). 

Figure 14 depicts position-specific strand twists (vf// + (t)/+i) for the 20 
structures in the ensembles determined for bhpW (circles), VH (triangles), and HV 
25 (diamonds). 

Figures 15A-15C depict sequences of BR3 variants and structure of 
bhpBR3. (A) Amino acid sequences of BR3 variants used in this study. (B and C) 
Three-dimensional structure of bhpBR3 determined by NMR spectroscopy. The 
30 backbone atoms of 20 models are shown superposed with residue labels positioned 
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in the direction of the side chain (B); one representative structure highlighting the 
BR3 turn residues (C) in the same orientation as in B. 

Figures 16A-16B depict binding of BR3 variants to BAFF. (A) Competitive 
5 displacement of biotinylated miniBR3 measured by ELISA (see methods). Data are 
shown for BR3 extracellular domain (filled triangles), miniBR3 (open squares), and 
bhpBR3 (filled circles). IC50 values from the fitted curves are 70 nM, 65 nM, and 15 
^M, respectively. (B) HeLa cells expressing a chimeric receptor composed of the 
extracellular ligand-binding domain of BR3 fiised to the death domain of DR4 were 
10 seeded 16 hr before treatment. BAFF (2 nM) was added to cells alone or after being 
preincubated with miniBR3 (3 jiM), bhpBR3 (100 nM) or a control hairpin peptide 
with an unrelated turn sequence in the same bhp scaffold (bhpC, 100 jiM) for 30 min 
at room temperature. Apoptosis was assessed 24 hr later. The turn sequences of 
bhpBR3 and bhpC are shown above the corresponding bars. 

15 Mode(s) for Carrvine Out the Invention 

L Definitions 

The term "p-tum" refers to a protein secondary structure consisting of a 
tetrapeptide sequence which causes the peptide chain to reverse direction, and which 
often contains a 4' to T hydrogen bond, forming a pseudo 10-membered ring. The 

20 most widely accepted classification of the different conformations of the p-tum is 
described in Chou and Fasman (1977) J Mol Biol 115:135-175, the disclosure of 
which is expressly incorporated by reference herein. Various P-tum types have been 
defined, including for example, type I, I', II, and IF. For the purpose of this 
invention, the term "reverse-turn" is used in a general sense to encompass well 

25 known protein secondary structures including p-tums, y-tums, p-hairpins and p- 
bulges. 

"Cell," "cell line," and "cell culture" are used interchangeably herein and 
such designations include all progeny of a cell or cell line. Thus, for example, terms 
like "transformants" and "transformed cells" include die primary subject cell and 
30 cultures derived therefrom without regard for the number of transfers. It is also 
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understood that all progeny may not be precisely identical in DNA content, due to 
deliberate or inadvertent mutations. Mutant progeny that have the same function or 
biological activity as screened for in the originally transformed cell are included. 
Where distinct designations are intended, it will be clear from the context. 

5 The terms "competent cells" and "electoporation competent cells" mean 

cells which are in a state of competence and able to take up DNAs from a variety of 
sources. The state may be transient or permanent. Electroporation competent cells 
are able to take up DNA during electroporation. 

"Control sequences" when referring to expression means DNA sequences 
10 necessary for the expression of an operably linked coding sequence in a particular 
host organism. The control sequences that are suitable for prokaryotes, for example, 
include a promoter, optionally an operator sequence, a ribosome binding site, and 
possibly, other as yet poorly understood sequences. Eukaryotic cells are known to 
utilize promoters, polyadenylation signals, and enhancers. 

15 The term "coat protein" means a protein, at least a portion of which is present 

on the surface of the virus particle. From a functional perspective, a coat protein is 
any protein which associates with a virus particle during the viral assembly process 
in a host cell, and remains associated with the assembled virus until it infects another 
host cell. The coat protein may be the major coat protein or may be a minor coat 

20 protein. A "major" coat protein is a coat protein which is present in the viral coat at 
10 copies of the protein or more. A major coat protein may be present in tens, 
hundreds or even thousands of copies per virion. 

The terms "electroporation" and "electroporating" mean a process in which 
foreign matter (protein, nucleic acid, etc.) is introduced into a cell by applying a 
25 voltage to the cell under conditions sufBcient to allow uptake of the foreign matter 
into the cell. The foreign matter is typically DNA. 

A "fusion protein" is a polypeptide having two portions covalently linked 
together, where each of the portions is a polypeptide having a different property. 
The property may be a biological property, such as activity in vitro or in vivo. The 
30 property may also be a simple chemical or physical property, such as binding to a 
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target molecule, catalysis of a reaction, etc. The two portions may be linked directly 
by a single peptide bond or through a peptide linker containing one or more amino 
acid residues. Generally, the two portions and the linker will be in reading frame 
with each other. 

5 "Heterologous DNA" is any DNA that is introduced into a host cell. The 

DNA may be derived from a variety of sources including genomic DNA, cDNA, 
synthetic DNA and fusions or combinations of these. The DNA may include DNA 
from the same cell or cell type as the host or recipient cell or DNA from a different 
cell type, for example, from a mammal or plant. The DNA may, optionally, include 
10 selection genes, for example, antibiotic resistance genes, temperature resistance 
genes, etc. 

"Ligation" is the process of forming phosphodiester bonds between two 
nucleic acid fragments. For ligation of the two fragments, the ends of the fragments 
must be compatible with each other. In some cases, the ends will be directly 

15 compatible after endonuclease digestion. However, it may be necessary first to 
convert the staggered ends commonly produced after endonuclease digestion to 
blunt ends to make them compatible for ligation. For blunting the ends, the DNA 
may be treated in a suitable buffer for at least 15 minutes at 15®C with about 10 
units of the Klenow fragment of DNA polymerase I or T4 DNA polymerase in the 

20 presence of the four deoxyribonucleotide triphosphates. The DNA may then 
purified by phenol-chloroform extraction and ethanol precipitation. The DNA 
fragments that are to be ligated together are put in solution in about equimolar 
amounts. The solution will generally also contain ATP, ligase buffer, and a ligase 
such as T4 DNA ligase at about 10 units per 0.5 [ig of DNA. If the DNA is to be 

25 ligated into a vector, the vector is first linearized by digestion with the appropriate 
restriction endonuclease(s). The linearized fragment is then treated with bacterial 
alkaline phosphatase or calf intestinal phosphatase to prevent self-ligation during the 
ligation step. 

A "mutation" is a deletion, insertion, or substitution of a nucleotide(s) 
30 relative to a reference nucleotide sequence, such as a wild type sequence. 
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"'Operably linked" when referring to nucleic acids means that the nucleic 
acids are placed in a functional relationship with another nucleic acid sequence. For 
example, DNA for a presequence or secretory leader is operably linked to DNA for a 
polypeptide if it is expressed as a preprotein that participates in the secretion of the 
5 polypeptide; a promoter or enhancer is operably linked to a coding sequence if it 
affects the transcription of the sequence; or a ribosome binding site is operably 
linked to a coding sequence if it is positioned so as to facilitate translation. 
Generally, "operably linked" means that the DNA sequences being linked are 
contiguous and, in the case of a secretory leader, contiguous and in reading phase. 
10 However, enhancers do not have to be contiguous. Linking is accomplished by 
ligation at convenient restriction sites. If such sites do not exist, the synthetic 
oligonucleotide ad^ters or linkers are used in accord with conventional practice. 

"Phage display" is a technique by which variant polypeptides are displayed 
as fusion proteins to a coat protein on the surface of phage, e.g. filamentous phage 

15 particles. A utility of phage display lies in the fact that large libraries of randomized 
protein variants can be rapidly and efficiently sorted for those sequences that bind to 
a target molecule with high affinity. Display of peptides and proteins libraries on 
phage has been used for screening millions of polypeptides for ones with specific 
binding properties. Polyvalent phage display methods have been used for displaying 

20 small random peptides and small proteins through fusions to either gene in or gene 
VIII of filamentous phage. Wells and Lowman (1992) Curr. Opin, Biotech. 3:355- 
362 and references cited therein: In monovalent phage display, a protein or peptide 
library is fused to a gene III or a portion thereof and expressed at low levels in the 
presence of wild type gene HI protein so that phage particles display one copy or 

25 none of the fusion proteins. Avidity effects are reduced relative to polyvalent phage 
so that sorting is on the basis of intrinsic ligand affinity, and phagemid vectors are 
used, which simplify DNA manipulations. Lowman and Wells (1991) Methods: A 
companion to Methods in Enzymology 3:205-216. In phage display, the phenotype 
of the phage particle, including the displayed polypeptide, corresponds to the 

30 genotype inside the phage particle, the DNA enclosed by the phage coat proteins. 

A "phagemid" is a plasmid vector having a bacterial origin of replication, 
e.g., ColEl, and a copy of an intergenic region of a bacteriophage. The phagemid 
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may be based on any known bacteriophage, including filamentous bacteriophage. 
The plasmid will also generally contain a selectable marker for antibiotic resistance. 
Segments of DNA cloned into these vectors can be propagated as plasmids. When 
cells harboring these vectors are provided with all genes necessary for the 
5 production of phage particles, the mode of replication of the plasmid changes to 
rolling circle replication to generate copies of one strand of the plasmid DNA and 
package phage particles. The phagemid may form infectious or non-infectious 
phage particles. This term includes phagemids which contain a phage coat protein 
gene or fragment thereof linked to a heterologous polypeptide gene as a gene fusion 
10 such that the heterologous polypeptide is displayed on the surface of the phage 
particle. Sambrook et aL 4A 7. 

The term "phage vector" means a double stranded replicative form of a 
bacteriophage containing a heterologous gene and capable of replication. The phage 
vector has a phage origin of replication allowing phage replication and phage 
15 particle formation. The phage is preferably a filamentous bacteriophage, such as an 
M13, fl, fd, Pf3 phage or a derivative thereof, a lambdoid phage, such as lambda, 
21, phiSO, phiSl, 82, 424, 434, etc., or a derivative thereof, a Baculovirus or a 
derivative thereof, a T4 phage or a derivative thereof , a T7 phage virus or a 
derivative thereof. 

20 "Preparation" of DNA from cells means isolating the plasmid DNA from a 

culture of the host cells. Commonly used methods for DNA preparation are the 
large- and small-scale plasmid preparations described in sections 1.25-1.33 of 
Sambrook et aL After preparation of the DNA, it can be purified by methods well 
known in the art such as that described in section 1 .40 of Sambrook et aL 

25 "Oligonucleotides'* are short-length, single- or double-stranded 

polydeoxynucleotides that are chemically synthesized by known methods (such as 
phosphotriester, phosphite, or phosphoramidite chemistry, using solid-phase 
techniques such as described in EP 266,032 published 4 May 1988, or via 
deoxynucleoside H-phosphonate intermediates as described by Froehler et aL (1986) 

30 NucL Acids Res., 14:5399-5407). Further methods include the polymerase chain 
reaction defined below and other autoprimer methods and oligonucleotide syntheses 
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on solid supports. All of these methods are described in Engels et al (1989) Agnew. 
Chem. Int. Ed Engl 28:716-734 . These methods are used if the entire nucleic acid 
sequence of the gene is known, or the sequence of the nucleic acid complementary 
to the coding strand is available. Alternatively, if the target amino acid sequence is 
5 known, one may infer potential nucleic acid sequences using known and preferred 
coding residues for each amino acid residue. The oligonucleotides are then purified 
on poiyacrylamide gels. 

"Polymerase chain reaction" or "PGR" refers to a procedure or technique in 
which minute amounts of a specific piece of nucleic acid, RNA and/or DNA, are 

10 ampHfied as described in U.S. Patent No. 4,683,195 issued 28 July 1987. Generally, 
sequence information fi"om the ends of the region of interest or beyond needs to be 
available, such that oligonucleotide primers can be designed; these primers will be 
identical or similar in sequence to opposite strands of the template to be amplified. 
The 5' temiinal nucleotides of the two primers may coincide with the ends of the 

15 amplified material. PGR can be used to amplify specific RNA sequences, specific 
DNA sequences fi-om total genomic DNA, and cDNA transcribed fi^om total cellular 
RNA, bacteriophage or plasmid sequences, etc. See generally Mullis et al. (1987) 
Cold Spring Harbor Symp. Quant, Biol 51:263 ; Erlich, ed., PCR Technology, 
(Stockton Press, NY, 1989). As used herein, PGR is considered to be one, but not 

20 the only, example of a nucleic acid polymerase reaction method for amplifying a 
nucleic acid test sample comprising the use of a known nucleic acid as a primer and 
a nucleic acid polymerase to amplify or generate a specific piece of nucleic acid. 

DNA is "purified" when the DNA is separated fi-om non-nucleic acid 
impurities. The impurities may be polar, non-polar, ionic, etc. 

25 "Recovery" or "isolation" of a given fragment of DNA fi-om a restriction 

digest means separation of the digest on poiyacrylamide or agarose gel by 
electrophoresis, identification of the fi-agment of interest by comparison of its 
mobility versus that of marker DNA fragments of known molecular weight, removal 
of the gel section containing the desired fragment, and separation of the gel from 

30 DNA. This procedure is known generally. For example, see Lawn et al (1981) 
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Nucleic Acids Res., 9:6103-6114, and Goeddel et al (1980) Nucleic Acids Res., 
8:4057. 

A "transcription regulatory element" will contain one or more of the 
following components: an enhancer element, a promoter, an operator sequence, a 
5 repressor gene, and a transcription termination sequence. These components are 
well known in the art. U.S. 5,667,780. 

A "transformant" is a cell which has taken up and maintained DNA as 
evidenced by the expression of a phenotype associated with the DNA (e.g., 
antibiotic resistance conferred by a protein encoded by the DNA). 

10 "Transformation" or "transforming" means a process whereby a cell takes up 

DNA and becomes a "transformant". The DNA uptake may be permanent or 
transient. 

A "variant" or "mutant" of a starting polypeptide, such as a fusion protein or 
a heterologous polypeptide (heterologous to a phage), is a polypeptide that 1) has an 

15 amino acid sequence different from that of the starting polypeptide and 2) was 
derived from the starting polypeptide through either natural or artificial (manmade) 
mutagenesis. Such variants include, for example, deletions from, and/or insertions 
into and/or substitutions of, residues within the amino acid sequence of the 
polypeptide of interest. Any combination of deletion, insertion, and substitution 

20 may be made to arrive at the final variant or mutant construct, provided that the final 
construct possesses the desired fiinctional characteristics. The amino acid changes 
also may alter post-translational processes of the polypeptide, such as changing the 
number or position of glycosylation sites. Methods for generating amino acid 
sequence variants of polypeptides are described in U. S. 5,534,615, expressly 

25 incorporated herein by reference. 

The term "peptide analog" refers to a molecule or part thereof which is 
comprised of amino acids and resembles, with regard to its binding ability and/or 
specificity, a specific molecule, as defined above. Such peptide analogs may be 
found or constructed by protein engineering techniques, such methods being well 
30 known to those of skill in the art. Alternatively, such peptide analogs may be foimd 
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by a screening process, for example wherein a natural binding partner of the specific 
molecule (which specific molecule is not necessarily a protein or peptide), or a 
portion thereof, is used as described herein (i.e. in a chimeric protein) to screen 
peptide compounds for the ability to bind to it. In a second screening step, the newly 
5 found peptide compound (or a portion thereof) may itself be used as a peptide 
analog of the specific molecule in a chimeric protein to screen for analogs of the 
natural binding partner. Other methods for finding or making peptide analogs will be 
apparent to those of skill in the art. 

The term "epitope" means an antigen or portion thereof which is capable of 
10 binding with an antibody as an antigenic determinant. 

By "binding partner complex" is meant the association of two or more 
molecules which are bound to each other in a specific, detectable manner; thus the 
association of ligand and receptor, antibody and antigen, and chimeric protein and 
the compound to which it binds. 

15 The term "directly or indirectly labeled" refers to a molecule may contain a 

label moiety which moiety emits a signal which is capable of being detected, such as 
a radioisotope, a dye, or a fluorescent or chemiluminescent moiety, or may contain a 
moiety, such as an attached enzyme, ligand such as biotin, enzyme substrate, 
epitope, or nucleotide sequence which is not itself detected but which, through some 

20 additional reaction, is capable of indicating the presence of the compound. 

By "ligand" is meant a molecule or a muhimeric molecular complex which is 
able to specifically bind another given molecule or molecular complex. Often, 
though not necessarily, a ligand is soluble while its target is immobilized, such as by 
an anchor domain imbedded into a cell membrane. 

25 The term "receptor" refers to at least a portion of a molecule, or a multimeric 

molecular complex which has an anchor domain embedded into a cell membrane 
and is able to bind a given molecule or molecular complex. Many receptors have 
particularly high affinity for a ligand when either or both the receptor or ligand are 
in a homo- or hetero multimeric form, such as a dimer. 
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The tenn "solid support" refers to an insoluble matrix either biological in 
nature, such as, without limitation, a cell or bacteriophage particle, or synthetic, such 
as, without limitation, an acrylamide derivative, cellulose, nylon, silica, and 
magnetized particles, to which soluble molecules may be linked or joined. 

5 By "naturally-occuring" is meant normally found in nature. Although a 

chemical entity may be naturally occurring in general, it need not be made or 
derived from natural sources in any specific instance. 

By "non naturally-occurring" is meant rarely or never found in nature and/or 
made using organic synthetic methods. 

10 "Modified" means non naturally-occuring or altered in a way that deveates 

from naturally-occurring compounds. 

IL Genera] 

The present invention is directed to conformationally-constrained peptides 
and peptide libraries that are usefril for structure-activity analysis of bioactive 

15 molecules and for drug lead discovery. The peptide of the invention comprises two 
Cysteine residues that are capable of forming disulfide bond with each other. Thus, 
the peptide adopts a cyclic form in solution, which facilitates the formation of a 
hairpin scaffold. Disulfide cyclization is helpful, although not sufficient to constrain 
the structure of many peptides. The rest of the residues of the peptide are fiirther 

20 selected to be significantly biased toward the formation of the hairpin structure. 
Moreover, a subset of the residues within the peptide of the invention is varied to 
provide relative diversity for mimicking various bioactive peptides having a 
identified secondary structure, such as P-tum, which has been proven significant in 
biological processes. 

25 In one aspect, the invention encompasses a peptide library comprising a 

collection of structurally-constrained cyclic peptides. Each peptide member of the 
library comprises amino acid sequence Cl-Al-A2-(A3)n-A4-A5-C2 [SEQ ID 
N0:1], wherein 

Al, A2, A3, A4, and A5 are naturally occurring L-amino acids; 
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the terminus of Cysteine CI is optionally protected with an amino protecting 

group; 

the terminus of Cysteine C2 is optionally protected with a carboxy protecting 

group; 

5 Al and A5 are selected from the group consisting of amino acids W, Y, F, H, 

I,VandT; 

A2 and A4 are selected from the group consisting of amino acids W, Y, F, L, 
M, I, and V; 

A3 is any naturally occurring L-amino acid and n is an integer that is 
10 selected from the group consisting of 3, 4, 5, 6, 7, 8, 9, 10, 1 1 and 12; and 

CI and C2 are joined together by a disulfide bond thereby forming a cyclic 
peptide. 

In one preferred embodiment, the peptides of the invention have a [5- 
branched residue having two non-hydrogen substituents on the P-carbon of the 
15 amino acid residue at position Al or A5 or both. Preferred peptides of the invention 
have a branched aliphatic residue I, V or T at Al, A5 or both. More preferably, Al 
or A5 is threonine (T). Even more preferably, both Al and A5 are threonine 
residues. 

According to another preferred embodiment, the peptides have an aromatic 
20 residue W, Y, F or H at position Al or A5 or both. More preferably, Al or A5 is W. 
One preferred peptide contains H at Al and V at A5. 

In another preferred embodiment, the peptides of the invention have an 
aromatic residue W, Y or F at position A2 or A4 or both. More preferably, A2 or 
A4 is W; and even more preferably, A2 and A4 are Ws. Another preferred 
25 embodiment include peptides having an unbranched aliphatic residue L or M at 
position A2 or A4 or both; more preferably A2 or A4 is Leucine. Still other 
preferred peptides have a branched aliphatic residue I or V at position A2 or A4 or 
both. 
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In the peptides of the invention, the niunber of the A3 residues n can be 3, 4, 
5, 6. 7, 8, 9, 10, 1 1, or 12; preferably 4. 5, 6, 7, 8, 9, or 10; and more preferably 4, 5 
or 6. In one embodiment, n is 4 and the resulting peptides are decamers. In these 
decamers, the residue sites Al, A2, A4 and AS are each from a selected group of 
5 amino acid residues as described above, whereas the middle (A3)4 is a tetrapeptide 
sequence with varying amino acids. In one aspect of the invention, the (A3)4 
tetrapeptide sequence is selected from those favorable to forming a P-tum structure, 
including but not limited to EGNK, ENGK, QGSF, VWQL and GPLT. 

In one aspect, the library of the instant invention contains at least about 10^ 
10 member peptides, each of which has at least one amino acid variation from others. 
Preferably, the library contains at least about 10^ peptides, more preferably about 
10*° peptides and even more preferably at least about 10^^ peptides. According to 
various embodiments, the amino acid variation occurs at defined positions within the 
peptides. For example, variations can occur at hydrogen-bonded (HB) strand sites 
15 (e.g., A1/A5) or non hydrogen-bonded (NHB) strand sites (e.g., A2/A4); a residue 
and its cross-strand counterpart (e.g., A1/A5 or A2/A4) can have same or different 
amino acids. Variations can also occur at the middle (A3)n sites, wherein A3 can be 
any of the 20 naturally occurring L-amino acids. 

The carboxy terminal end and the amino terminal end of the cyclic peptide 
20 may be protected with any known protecting groups or may be bonded to other 
amino acid residues (generally naturally occurring residues), both in the (L) and in 
the (D) form through conventional amide peptide bonds. The protecting groups and 
additional residues can be added using conventional peptide synthesis techniques. 
Generally from 1 to about 50, preferably from 1 to about 20, amino acid residues 
25 may be present on each of the carboxy and amino terminal positions, independently. 
These additional residues may be part of a known protein containing a beta turn of 
interest or may be any other desired sequence of residues. These additional residues 
may be added to determine the effect of the beta turn structure on the structure of the 
overall polypeptide or to determine the effect of the additional residues on the 
30 binding of the beta turn cyclic peptide with a protein of interest. 
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Alternatively, a library of cyclic peptides of the invention can be prepared in 
which one or more of residues Al, A2, A4, and/or A5 are independently fixed and 
residues A3 are varied using known methods of generating peptide libraries. A 
preferred method of generating a library is phage display. Any known method of 
5 phage display, such as those discussed in more detail below, may be used in the 
method of the invention. 

In another aspect, the invention encompasses a cyclic peptide scaffold for 
presenting a p-tum hairpin structure. The cyclic peptide scaffold comprises the 
amino acid sequence C1-A1-A2-(A3)„-A4-A5-C2 (SEQ ID N0:1), wherein CI and 

10 C2 are cysteines; Al, A2, A3, A4, and A5 are naturally occurring L-amino acids; the 
terminus of Cysteine CI is optionally protected with an amino protecting group; the 
terminus of Cysteine C2 is optionally protected with a carboxy protectmg group; Al 
and A5 are selected from the group consisting of amino acids W, Y, F, H, I, V and 
T; A2 and A4 are selected from the group consisting of amino acids W, Y, F, L, M, 

15 1, and V; A3 is any naturally occurring L-amino acid and n is an integer that is 
selected from the group consisting of 3, 4, 5, 6, 7, 8, 9, 10, 11 and 12; and CI and 
C2 are joined together by a disulfide bond thereby forming a cyclic peptide. 

In one preferred embodiment, the peptide scaffolds of the invention have a p- 
branched residue having two non-hydrogen substituents on the p-carbon of the 
20 amino acid residue at position Al or A5 or both. Preferred peptide scaffolds of the 
invention have a branched aliphatic residue I, V or T at Al, A5 or both. More 
preferably, Al or A5 is threonine (T). Even more preferably, both Al and A5 are 
threonine residues. 

According to another embodiment, the peptide scaffolds have an aromatic 
25 residue W, Y, F or H at position Al or A5 or both. More preferably, Al or A5 is W. 
One preferred peptide scaffold of the invention has H at Al and V at A5. 

In another preferred embodiment, the peptide scaffolds of the invention have 
an aromatic residue W, Y or F at position A2 or A4 or both. More preferably, A2 or 
A4 is W; and even more preferably, A2 and A4 are Ws. Another preferred 
30 embodiment include peptide scaffolds having an unbranched aliphatic residue L or 
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M at position A2 or A4 or both; more preferably A2 or A4 is Leucine. Still other 
preferred peptides have a branched aliphatic residue I or V at position A2 or A4 or 
both. 

In the peptides of the invention, the number of the A3 residues n can be 3, 4, 
5 5, 6, 7, 8, 9, 10, 11, or 12; preferably 4, 5, 6, 7, 8, 9, or 10; and more preferably 4, 5 
or 6. In one embodiment, n is 4 and the resulting peptides are decamers. In these 
decamers, the residue sites Al, A2, A4 and AS are each from a selected group of 
amino acid residues as described above, whereas the middle (A3)4 is a tetrapeptide 
sequence with varying amino acids. In one aspect of the invention, the (A3)4 
10 tetrapeptide sequence is selected from those favorable to forming a P-tum structure, 
including but not limited to EGNK, ENGK, QGSF, VWQL and GPLT. 

The carboxy terminal end and the amino terminal end of the cyclic peptide 
scaffold may be protected with any known protecting groups or may be bonded to 
other amino acid residues (generally naturally occurring residues), both in the (L) 

15 and in the (D) form through conventional amide peptide bonds. The protecting 
groups and additional residues can be added using conventional peptide synthesis 
techniques. Generally from 1 to about SO, preferably from 1 to about 20, amino acid 
residues may be present on each of the carboxy and amino terminal positions, 
independently. These additional residues may be part of a known protein containing 

20 a beta turn of interest or may be any other desired sequence of residues. These 
additional residues may be added to determine the effect of the beta turn structure on 
the structure of the overall polypeptide or to determine the effect of the additional 
residues on the binding of the beta turn cyclic peptide with a protein of interest. 

The present invention also encompasses methods of screening for peptides 
25 having a p-hairpin scaffold that is conformationally stabilized, comprising the steps 
of a) providing a combinatorial library of the invention as described above; b) 
selecting at least two peptides from the combinatorial library, wherein said at least 
two peptides differ by one amino acid at a particular position Al, A2, A3, A4 or A5; 
c) determining the conformations of the peptides; d) measuring and comparing the 
30 relative stabilities of the peptides; and e) selecting the peptide having a 
conformationally stabilized P-hairpin scaffold. The conformation and stability of 
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the peptides can be determined using many methods known in the art such as NMR, 
molecular modeling, crystallography and free energy calculation. See, for example, 
Cavanagh et ai (1995) Protein NMR Spectroscopy, Principles and Practices 
(Academic Press, San Diego). Particular methods of determining peptide 
5 conformation and stability are described in more detail below by way of examples. 
The p-tum containing peptides of the invention can be useful for mimicking native 
bioactive proteins in their binding activities. 

The identity of the P-tum residues A3 may be determined by studying known 
protein structures and then substituting the known structural sequence into the 

10 structured beta hairpin compound of the invention. In this embodiment, residues A3 
are taken from the known protein whereas residues Al, A2, A4 and AS are as 
described for the invention. In this way, the fixed residues of the invention can be 
used to structure particular turns from proteins of interest, allowing one to test 
whether the protein turn is sufficient for binding to a known protein binding partner, 

15 or for antagonizing the relevant protein-protein interaction. 

The invention also includes methods of identifying a peptide capable of 
binding a specific binding partner, comprising the steps of a) providing a 
combinatorial library as described above; b) contacting the combinatorial library 
with a binding partner; c) selecting from the library peptides capable of forming a 
20 noncovalent complex with the binding partner; and d) optionally isolating said 
peptides of step. Methods and technologies for assessing peptide binding activity 
and isolating peptides of interest are known in the art and described in more detail 
below. 

Binding partners of the peptides of the invention can be at least a portion of 
25 any molecules, including any known or unknown peptides, proteins, other 
macromolecules or chemical compounds that are capable of binding to the peptides 
and optionally exerting bioactivities. Protein molecules such as receptors, ligands, 
antigens, antibodies, enzymes and enzyme substrates and fragments or portions 
thereof are encompassed by "binding partners." Other non-protein chemical 
30 compounds, organic or inorganic, can also be the binding partners of the peptides. 
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III. p-bairpin Peptides 

One embodiment of the present invention involves a cyclic peptide scaffold 
that adopt p-hairpin conformations in solution. The component parts of p-hairpin 
structure include paired antiparallel p-strands and, preferably, p-tums. The 
5 preferential placement of disulfide-bonded cysteine pairs at non-hydrogen bonded 
sites in the p-strands has been studied, as has been specific pairs of cross-strand 
residues that are statistically favored (in either hydrogen bonded or non-hydrogen 
bonded sites), at least in proteins. One study describes experimental stability 
measurements of mutant proteins in which various pairs of residues have been 

10 introduced into hydrogen bonded sites on adjacent antiparallel strands. Smith & 
Regan (1995) Science 270: 980-982. Attempts have been made to determine 
intrinsic preferences for individual amino acids to adopt conformations suited to the 
geometry of a P-strand, either by analyzing the residue content of P-strands (Chou & 
Passman (1978) Annu. Rev, Biochem, 47:251-276) or by substituting various amino 

15 acids into a P-strand of a protein and measuring the relative stabilities of the mutants 
(Kim & Berg (1993) Nature 362:267-270; Minor & Kim (1994) Nature 367:660- 
663; Minor & Kim (1993) /Va/wre 371:264-267; SvmXh et ai Biochemistry i\99A) 
33:5510-5517). A revised statistical method for assigning residue conformations 
has improved correlation with the various experimental propensity scales (Munoz & 

20 Serrano (1994) Proteins 20:301-311). The propensity assigned to tryptophan is 
moderate in all reported scales. 

It has recently been shown that for some short, linear peptides (4-16 amino 
acids), the hairpin conformation is partially populated in aqueous solution. Both 
designed peptides and peptides taken from protein sequences have exhibited this 
25 behavior. In general, these studies involve peptides with statistically strong turn 
sequences (e.g., asn-gly at i+1, i+2). Nevertheless, hairpin populations seldom 
exceed 40-50% in aqueous solution. 

A 16-mer peptide derived from the protein ubiquitin but with a statistically 
more common turn sequence (MQIGVKNPDGTITLEV) did form a highly 
30 populated hairpin in water (ca. 80%), but the hairpin did not have the same strand 
register as in the native protein (Searle et al (1995) Nat Struct. Biol 2:999-1006). 
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Another group studied a similar peptide in which the turn region was replaced with 
several sequences (MQIGVKSXXKTITLKV, wherein XX = pro-ala or pro-gly; 
Haque & Gellman (1997) 7. Am. Chem, Soc. 119:2303-2304). Evidence for the 
hairpin structure, with native strand register, was observed for turns containing D- 
5 amino acids but not for L-amino acid sequences. No population estimates were 
given in this study. 

Several groups have studied model peptides based originally on a sequence 
from the protein tendamistat. The peptide YQNPDGSQA shows NMR evidence of a 
small population of hairpin in water (Blanco et al (1993) 7. Am. Chem, Soc. 

10 115:5887-5888; de Alba et al (1995) Eur. J. Biochem. 233:283-292; Constantine et 
al. (1995) J. Am. Chem. Soc. 117:10841-10854; Friedrichs et al J. Am. Chem. Soc. 
(1995) V 117, pp 10855-10864). A variant of this peptide with strand residues of 
higher expected P-propensity (lYSNPDGTWT) was compared to a second peptide 
with a different turn sequence (lYSNSDGTWT). Both peptides were estimated by 

15 NMR as 30% hairpin in water (de Alba et al (1996) Fold. Des, 1:133-144). Further 
variation of this peptide, predominantly in the turn sequence, yielded hairpins of 
various structures and mixed populations. Generally no one conformer population 
exceeded 50% (de Alba et al (1997) / Am. Chem. Soc. 119:175-183). In a final 
study, the three N-terminal residues in peptide ITSNSDGTWT were replaced with 

20 various sequences. Again, mixed conformers were frequently observed and 
populations of a given hairpin conformer were generally less than 50%: one peptide 
(YITNSDGTWT) did form a register-shifted hairpin that was highly populated 
(80%; de Alba et al (1997) Protein ScL 6:2548-2560). The authors of these studies 
conclude that conformational preferences of the turn residues dominate cross-strand 

25 interactions in determining the stability of hairpins, at least in these short model 
peptides. 

Analysis of hairpin sequences in crystal structures has allowed the design of 
a different series of P-hairpin peptides. The target structure was a type I' turn 
flanked by three-residue strands. Arg-gly sequences were added to the ends to 
30 improve solubility. The peptide RGITVNGKTYGR is partially folded into a hairpin 
conformation (about 30%) as determined by NMR (Ramirez- Alvarado et al (1996) 
Nat. Struct. Biol 3:604-612). The importance of strand residues is indicated by 
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rqjlacement of the ile and val, the lys and tyr, or all four residues with alanine. 
None of the alanine-substituted peptides showed any tendency to fonn a hairpin. 
The same authors reported a second series of experiments in which position i+1 of 
the turn was varied (asn to asp, ala, gly or ser). No peptide was more structured than 
5 the original sequence with asn in the turn (Ramirez- Alvarado et al. (1997) J. MoL 
Biol, 273:898-912). A review describing this work stated that adding glu-lys pairs 
to the termini of the model peptide stabilized the hairpin but did not give further 
details (Ramirez-Alvarado et ai (1999) Bioorg. Med. Chem. 7:93-103). 

Another model peptide series (RYVEVXGOmKILQ) has yielded evidence 
10 for hairpin formation in water. Residue X as D-pro or L-asn yields characteristic 
NOEs and alpha-H shifts, but the L-pro peptide is unfolded. No population 
estimates are given, but D-pro appears to give the more stable hairpin (Stanger & 
Gelhnan (1998) /. Am. Chem. Soc. 120:4236-4237). A later study used a disulfide- 
cyclized and back bone-cyclized version of this peptide as a model for the fully 
15 folded hairpin, allowing the estimation of hairpin population in uncyclized analogs 
(30-70%). Syud, a (1999) Am Chem Soc. 121:11577-11578. 

A designed 16-residue peptide (KKYTVSDSTGKKITVSI) based on the met 
repressor DNA binding region forafied a hairpin structure in water with an estimated 
population of 50% at 303 K. Truncation of one strand showed that the turn was 

20 populated without the strand interactions, although to a lesser degree (35%), An 
analysis of the thermodynamic parameters for hairpin formation showed that folding 
is enthalpically unfavored and entropically driven, with AG = 0.08 kcal/mol at 298 
K (Maynard & Searle (1997) Chem. Commun. 1297-1298; Griffiths-Jones et al. 
(1998) Chem. Commun. 789-790; Maynard et al (1998) J. Am. Chem. Soc. 

25 120:1996-2007). Griffiths- Jones, et al. (1999)/. MolBioL 292:1051-1069. 

A final hairpin peptide (GEWTYDDATKTFTVTE) derived fi-om the Bl 
domain of protein G (GBl) has some features relevant to the peptides of the 
invention. Unlike the above described model hairpins, the GBl hairpin has four 
threonine residues at hydrogen-bonded sites in the strands, including one thr-thr 
30 cross-strand pair. This is generally believed to be an unfavorable pairing. Li 
addition, there are trp-val and tyr-phe pairs at adjacent nonhydrogen-bonded sites 
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that might interact to form a small hydrophobic core. The reported data indicate that 
the GBl peptide formed a well-populated hairpin (about 50%) in water. The data 
are consistent with native strand pairing (Blanco et fl/.(1994) Nat, Struct Biol 
1:584-590). A denaturation study of the GBl peptide allowed estimation of 80% 
hairpin at 273 K, and analysis of the data (assuming ACp = 0) yielded AH = -11.6 
kcal/mol, AS = -39 cal/mol K: i.e., folding is enthalpically driven and entropically 
disfavored (Munoz et al Nature (1998) v 390, pp. 196-199). The relative roles of 
enthalpy and entropy are reversed compared to the met repressor peptide described 
above. Similar results were reported from an NMR study of the thermal 
denaturation of the GBl peptide. Honda et al. (2000) X Mol BioL 295:269-278. A 
mutagenesis study from the same group (Kobayashi et al. (2000) Biochemistry 
39:6564-6571) identified the cross-strand phenylalanine and tyrosine residues as 
especially important for hairpin stability. Finally, this group reported a disulfide- 
cyclized analog of the GBl peptide (CEWTYDDATKTFTVTCK; Kobayashi et al. 
(1999) Biochemistry 38:3228-3243). The structure of this analog was not described; 
however, it bound 7-fold more tightly to the remainder of the GBl protein than did 
the uncyclized peptide. 

Several designed three-stranded sheets have been reported: one of these 
contains only the usual 20 amino acids occurring in proteins and folds in water 
20 (Kortemme et al (1998) Science 281 :253-256). One aspect of the design is addition 
of a trp at a nonhydrogen-bonded position (by analogy to WW domains) while also 
changing two nonhydrogen-bonded residues on the next strand to unbranched amino 
acids. The authors state that the branched residues would not allow the trp side 
chain to pack across to the next strand. Thermodynamic analysis of denaturation 
25 data yields a folding free energy of -0.6 kcal/mol at 278 K (estimated folded 
population = 80-90%). 

Numerous examples have been reported of disulfide-constrained peptides 
intended to mimic protein hairpins or as de novo designed hairpins. In many cases 
the designs include D-cysteines at one or both ends, as it was initially thought that 
30 disulfide bond geometry was not compatible with the cross-strand geometry of 
hairpins. However, there are some examples that do use L-cys. 
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Evidence for structure is lacking in most studies of disulfide-cyclized 
peptides. Examples listed here are those that have been experimentally determined, 
or that use no unusual amino acids and have potency close to a larger, hairpin- 
containing protein in a biological assay. 

5 The structure of a hexapeptide (Boc-CL-Aib-AVC-NMe) was deteraiined 

crystallographically, revealing a type II* turn and p-sheet geometry (Karle et al J. 
Am. Chem. Soc. (1988) v 110, pp 1958-1963). An octapeptide with the same 
cysteine spacing (ACSPGHCE) was studied by NMR, and has a similar structure 
with a turn centered on pro-gly (Walse et al (1996) J. CompuL-Aided MoL Des, 
10 10:11-22). Peptides of the form Ac-CXPGXC-NHMe were evaluated by 
measurement of disulfide exchange equilibria, which indicated turn preferences 
between peptides of as much as 1 kcal/mol (Milbum et al (1987) J. Am. Chem. Soc, 
109:4486-4496). 

An eleven-residue cyclic peptide (CGVSRQGKPYC) based on the gene 5 
15 protein from Ml 3 is stably structured in aqueous solution, as demonstrated by NMR 
analysis. The cyclic peptide adopts a structure that is quite similar to the 
corresponding protein loop. The authors claim that well-defined p-hairpin structure 
had not been previously reported for any unprotected disulfide-constrained cycle 
(Rietman et al. (1996) Eur. J. Biochem. 238:706-713). This peptide has a val-pro 
20 pair at the nonhydrogen bonded sites nearest to the cysteines. 

Cyclization of peptides corresponding to loops from Limulus anti- 
lipopolysaccharide factor (LALF) based on X-ray structure yielded potent lipid A 
binders. There is no evidence for structure in these peptides. Several of the peptides 
have aromatic-aromatic pairs at the nonhydrogen-bonded sites nearest the cysteines; 
25 however, the most potent (GCKPTFRRLKWKYKCG) has a pro-tyr pair (Ried et al 
(1996)/ Biol Chem. 271:28120-28127). 

Disulfide-cyclized peptides from the hairpin region of a rabbit defensin have 
antibacterial activity exceeding (about 5 to 10-fold) that of the linear analogs. 
Circular dichroism spectroscopy indicates some non-random structure in phosphate 
30 buffer. The more potent peptide (CAGFMRIRGRIHPLCMRR) has a gly-pro pair at 
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the nonhydrogen bonded sites nearest to the cysteines (Thennarasu & Nagaraj 
{1999) Biochem. Biophys, Res. Commun. 254:281-283). 

A final study describes several peptides from the loops of domain 1 of 
human CD4. In addition to a disulfide constraint, the authors have added exocyclic 
5 aromatic amino acids to the peptide termini. For example, a peptide covering CD4 
residues 39-44 was constrained as FCNQGSFLCY. No evidence for structure is 
given, but one cyclic peptide (FCYICEVEDQCY) was reported to antagonize both 
normal CD4 interactions and those involved in CD4-mediated cell entry by HIV 
(Zhang et ai (1996) Nature Biotechnology 14:472-475; Zhang et al (1997) Nature 
1 0 Biotechnology 1 S: 1 50- 1 54). 

IV. Peptide Libraries 

Many methods for generating peptide libraries that are known in the art can 
be used to generate the libraries of the invention. In one embodiment, members of 
the peptide library can be created by split-synthesis performed on a solid support 
15 such as polystyrene or polyacrylamide resin, as described by Lam et al (1991) 
Nature 354:82 and PCT publication WO 92/00091. In one aspect of the invention, 
the library of cyclic peptides can be prepared in which one or more of residues Al, 
A2, A4, and/or A5 are independently fixed and residues A3 are varied. 

A preferred method of generating the library of the present invention is 
20 phage display. In a phage display library, the cyclic peptide of the invention is fiised 
to at least a portion of a phage coat protein to form a fusion protein. The fusion 
protein can be made by expressing a gene fiision encoding the fusion protein using 
known techniques of phage display such as those described below. The fusion 
protein may form part of a phage or phagemid particle in which one or more copies 
25 of the cyclic peptide are displayed on the surface of the particle. A gene comprising 
a nucleic acid encoding the cyclic peptide or the fusion protein are within the scope 
of the invention. 

In another embodiment, the invention is a method comprising the steps of 
constructing a library containing a plurality of replicable expression vectors, each 
30 expression vector comprising a transcription regulatory element operably linked to a 
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gene fusion encoding a fusion protein, wherein the gene fusion comprises a first 
gene encoding a cyclic peptide of the invention and a second gene encoding at least 
a portion of a phage coat protein, where the library comprises a plurality of genes 
encoding variant cyclic peptide fusion proteins. Variant first genes and libraries 
5 thereof encoding variant cyclic peptides are prepared using known mutagenesis 
techniques described in more detail below. 

The invention also includes expression vectors comprising the fusion genes 
noted above, as well as a library of these vectors. The library of vectors may be in 
the form of a DNA library, a library of virus (phage or phagemid) particles 
10 containing the library of fusion genes or in the form of a library of host cells 
containing a library of the expression vectors or virus particles. 

Also within the invention is a method of selecting novel binding 
polypeptides comprising (a) constructing a Hbrary of variant replicable expression 
vectors comprising a transcription regulatory element operably linked to a gene 

15 fusion encoding a fusion protein wherein the gene fusion comprises a first gene 
encoding the cyclic peptide of the invention, and a second gene encoding at least a 
portion of a phage coat protein, where the variant expression vectors comprise 
variant first genes; (b) transforming suitable host cells with the vectors; (c) culturing 
the transformed host cells under conditions suitable for forming recombinant phage 

20 or phagemid virus particles containing at least a portion of the expression vector and 
capable of transforming the host, so that the particles display one or more copies of 
the fusion protein on the surface of the particle; (d) contacting the particles with a 
target molecule so that at least a portion of the particles bind to the target molecule; 
and (e) separating the particles that bind from those that do not. In the method of the 

25 invention, the phage coat protein is preferably the gene III or gene VIII coat protein 
of a filamentous phage such as Ml 3. Further, preferably the culturing of the 
transformed host cells is under conditions suitable for forming recombinant phage or 
phagemid particles where the conditions are adjusted so that no more than a minor 
amount of phage or phagemid particles display one or more copies of the fusion 

30 protein on the surface of the particle (monovalent display). 
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The invention also includes a method of introducing structural bias into a 
phage-displayed library, using steps (a) through (e) described above. The invention 
further includes a method of selecting beta hairpin forming peptide structures from a 
phage-displayed library, using steps (a) through (e) described above where the target 
5 is known to bind beta hairpin peptide structures, preferably a protein target known to 
so bind. 

Bacteriophage (phage) display is a known technique by which variant 
polypeptides are displayed as fusion proteins to the coat protein on the surface of 
bacteriophage particles (Scott, J.K. and Smith, G. P. (1990) Science 249: 386). The 

10 utility of phage display lies in the fact that large libraries of selectively randomized 
protein variants (or randomly cloned cDNAs) can be rapidly and efficiently sorted 
for those sequences that bind to a target molecule with high affinity. Display of 
peptide (Cwirla et aL (1990) Proc, Natl Acad. Sci, USA 87:6378) or protein 
(Lx)wman et al. (1991) Biochemistry 30:10832; Clackson et aL (1991) Nature 352: 

15 624; Marks et aL (1991), J. MoL BioL 222:581; Kang et aL (1991) Proc. NatL 
Acad. ScL USA 88:8363) libraries on phage have been used for screening millions of 
polypeptides for ones with specific binding properties (Smith, G. P. (1991) Current 
Opin, BiotechnoL 2:668). Sorting phage libraries of random mutants requires a 
strategy for constructing and propagating a large number of variants, a procedure for 

20 affinity purification using the target receptor, and a means of evaluating the results 
of binding enrichments. U.S. 5,223,409; U.S. 5,403,484; U.S. 5,571,689; U.S. 
5,663,143. 

Typically, variant polypeptides, such as the cyclic compounds of the 
invention, are fused to a gene III protein, which is displayed at one end of the virion. 
25 Alternatively, the variant polypeptides may be fused to the gene VIII protein, which 
is the major coat protein of the virion. Such polyvalent display libraries are 
constructed by replacing the phage gene III with a cDNA encoding the foreign 
sequence fused to the amino temiinus of the gene III protein. 

Monovalent phage display is a process in which a protein or peptide 
30 sequence is fused to a portion of a gene III protein and expressed at low levels in the 
presence of wild-type gene III protein so that particles display mostly wild-type gene 
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III protein and one copy or none of the fusion protein (Bass et ai (1990) Proteins 
8:309; Lowman, H.B. and Wells, J.A. (1991) Methods: a Companion to Methods in 
Enzymology 3:205). Monovalent display has the advantage over polyvalent phage 
display that progeny phagemid particles retain fiill infectivity. Avidity effects are 
5 reduced so that sorting is on the basis of intrinsic ligand afifinity, and phagemid 
vectors, which simplify DNA manipulations, are used. See also U.S. 5,750,373 and 
U.S. 5,780,279. Others have also used phagemids to display proteins, particularly 
antibodies. U.S. 5,667,988; U.S. 5,759,817; U.S. 5,770,356; and U.S. 5,658,727. 

Methods of generating peptide libraries and screening these libraries are also 
10 disclosed in U. S. 5,723,286; U. S. 5,432,018; U. S. 5,580,717; U. S. 5,427,908; and 
U. S. 5,498,530. See also U. S. 5,770,434; U. S. 5,734,018; U. S. 5,698,426; U. 
S.5,763,192; and U. S. 5,723,323. 

A two-step approach may be used to select high affinity ligands from peptide 
libraries displayed on Ml 3 phage. Low affinity leads are first selected from naive, 
15 polyvalent libraries displayed on the major coat protein (protein VUI). The low 
affmity selectants are subsequently transferred to the gene in minor coat protein and 
matured to high affinity in a monovalent format. 

Although most phage display methods have used filamentous phage, 
lambdoid phage display systems (WO 95/34683; U. S 5,627,024), T4 phage display 

20 systems (Ren et al (1998) Gene 215:439; Zhu (1997) CAN 33:534; Jiang et al 
(1997) C4A^ 128:44380; Ren et al (1997) CAN 127:215644; Ren (1996) Protein Sci, 
5:1833; Efimov et al. (1995) Virus Genes 10:173) and T7 phage display systems 
(Smith & Scott (1993) Methods in Enzymology 217:228-257; U. S. 5,766,905) are 
also known and can be used to create a library of the cyclic peptides of the 

25 invention. 

Suitable gene ni vectors for display of cyclic peptides of the invention 
include fUSE5 (Scott, J. K., and Smith G. P. (1990) Science 249:386-390); fAFFl 
(Cwirla et al. (1990). Proc. Natl Acad, Sci. U.S.A. 87:6378-6382); fd-CATl 
(McCafiferty et al. (1990) Nature (London) 348:552-554); m663 (Fowlkes et al. 
30 (1992) Biotechniques 13:422-427); fdtetDOG, pHENl (Hoogenboom et al. (1991) 
Nucleic Acids Res. 19:4133-4137); pComb3 (Gram et al. (1992) Proc. Natl. Acad. 
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ScL aS.A. 89:3576-3580); pCANTAB 5E (Pharaiacia); and LamdaSurfZap 
(Hogrefe (1993)Ge«e 137:85-91). 

Phage display methods for proteins, peptides and mutated variants thereof, 
including constructing a family of variant replicable vectors containing a 
5 transcription regulatory element operably linked to a gene fusion encoding a fusion 
polypeptide, transforming suitable host cells, culturing the transformed cells to form 
phage particles which display the fusion polypeptide on the surface of the phage 
particle, contacting the recombinant phage particles with a target molecule so that at 
least a portion of the particle bind to the target, separating the particles which bind 

10 from those that do not bind, are known and may be used with the method of the 
invention. See U.S. 5,750,373; WO 97/09446; U.S. 5,514,548; U.S. 5,498,538; U.S. 
5,516,637; U.S. 5,432,018; WO 96/22393; U.S. 5,658,727; U.S. 5,627,024; WO 
97/29185; O'Boyle et al (1997) Virology 236:338-347; Soumillion et al (1994) 
Appl Biochem. Biotech. 47:175-190; O'Neil and Hoess. (1995) Curr. Opin. Struct, 

15 Biol 5:443-449; Makowski (1993) Gene 128:5-11; Dunn (1996) Curr, Opin, Struct. 
Biol 7:547-553; Choo and Klug (1995) Curr, Opin. Struct. Biol 6:431-436; 
Bradbury & Cattaneo (1995) TINS 18:242-249; Cortese et al, (1995) Curr, Opin, 
Struct. Biol 6:73-80; Allen et al (1995) TIBS 20:509-516; Lindquist & Naderi 
(1995) FEMS Micro. Rev. 17:33-39; Clarkson & Wells (1994) Tibtech. 12:173-184; 

20 Barbas (1993) Curr. Opin, Biol 4:526-530; McGregor (1996) Mol Biotech. 6:155- 
162; Cortese et al (1996) Curr, Opin, Biol 7:616-621; McLafferty et al (1993) 
Gene 128:29-36. 

The gene encoding the coat protein of the phage and the gene encoding the 
desired cyclic polypeptide portion of the fusion protein of the invention (i.e., the 
25 cyclic peptide of the invention fused to at least a portion of a phage coat protein) can 
be obtained by methods known in the art (see generally, Sambrook et al). The DNA 
encoding the gene may be chemically synthesized (Merrfield (1963) J, Am. Chem, 
Soc. 85 :2149) and then mutated to prepare a library of variants as described below. 

To ligate DNA fragments together to form a functional vector containing the 
30 gene fusion, the ends of the DNA fragments must be compatible with each other. In 
some cases, the ends will be directly compatible after endonuclease digestion. 
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However, it may be necessary to first convert the sticky ends commonly produced 
by endonuclease digestion to blunt ends to make them compatible for ligation. To 
blunt the ends, the DNA is treated in a suitable buffer for at least 15 minutes at 15*C 
with 10 units of the Klenow fi-agment of DNA polymerase I (Klenow) in the 
5 presence of the four deoxynucleotide triphosphates. The DNA is then purified by 
phenol-chloroform extraction and ethanol precipitation or other DNA purification 
technique. 

The cleaved DNA fi-agments may be size-separated and selected using DNA 
gel electrophoresis. The DNA may be electrophoresed through either an agarose or 
10 a polyacrylamide matrix. The selection of the matrix will depend on the size of the 
DNA fiagments to be separated. After electrophoresis, the DNA is extracted fi-om 
the matrix by electroelution, or, if low-melting agarose has been used as the matrix, 
by melting the agarose and extracting the DNA fi"om it, as described in sections 
6.30-6.33 of Sambrook et al 

15 The DNA fi-agments that are to be ligated together (previously digested with 

the appropriate restriction enzymes such that the ends of each fi-agment to be ligated 
are compatible) are put in solution in about equimolar amounts. The solution will 
also contain ATP, ligase buffer and a ligase such as T4 DNA ligase at about 10 units 
per 0.5 Mg of DNA. If the DNA fi-agment is to be ligated into a vector, the vector is 

20 at first linearized by cutting with the appropriate restriction endonuclease(s). The 
linearized vector is then treated with alkaline phosphatase or calf intestinal 
phosphatase. The phosphatasing prevents self-ligation of the vector during the 
ligation step. 

After ligation, the vector with the foreign gene now inserted is purified and 
25 transformed into a suitable host cell. A preferred transformation method is 

electroporation. Electroporation may be carried out using methods known in the art 

and described, for example, in U.S. 4,910,140; U.S. 5,186,800; U.S. 4,849,355; U.S. 

5,173,158; U.S. 5,098,843; U.S. 5,422,272; U.S. 5,232,856; U.S. 5,283,194; U.S. 

5,128,257; U.S. 5,750,373; U.S. 4,956,288 or any other known batch or continuous 
30 electroporation process. More than one (a plurality) electroporation may be 

conducted to increase the amount of DNA which is transformed into the host cells. 
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Repeated electroporalions are conducted as described in the art. See Vaughan et al 
(1996) Nature Biotechnology 14:309-314. The number of additional 
electroporalions may vary as desired from several (2,3,4,... 10) up to tens (10, 20, 
30,...100) and even hundreds (100, 200, 300,...1000). Repeated electroporations 
5 may be desired to increase the size of a combinatorial library, e.g. an antibody 
library, transfonned into the host cells. 

Preferably, for library construction, the DNA is present at a concentration of 
25 micrograms/mL or greater. More preferably, the DNA is present at a 
concentration of about 30 micrograms/mL or greater, more preferably at a 

10 concentration of about 70 micrograms/mL or greater and even more preferably at a 
concentration of about 100 micrograms/mL or greater even up to several hundreds 
of micrograms/mL. Generally, the electroporation will utilize DNA concentrations 
in the range of about 50 to about 500 micrograms/mL. A time constant during 
electroporation greater than 3.0 milliseconds (ms) results in a high transformation 

15 efficiency. 

The DNA is preferably purified to remove contaminants. The DNA may be 
purified by any known method, however, a preferred purification method is the use 
of DNA affinity purification. The purification of DNA, e.g., recombinant plasmid 
DNA, using DNA binding resins and affinity reagents is well known and any of the 

20 known methods can be used in this invention (Vogelstein, B. and Gillespie, D. 
(1979) Proc, Natl Acad, Sci. USA 76:615; Callen, W. (1993) Strategies 6:52-53). 
Commercially available DNA isolation and purification kits are also available from 
several sources including Stratagene (CLEARCUT Miniprep Kit), and Life 
Technologies (GLASSMAX DNA Isolation Systems). Suitable nonlimiting 

25 methods of DNA purification include column chromatography (U.S. 5,707,812), the 
use of hydroxylated silical polymers (U.S. 5,693,785), rehydrated silica gel (U.S. 
4,923,978), boronated silicates (U.S. 5,674,997), modified glass fiber membranes 
(U.S. 5,650,506; U.S. 5,438,127), fiuorinated adsorbents (U.S. 5,625,054; U.S. 
5,438,129), diatomaceous earth (U.S. 5,075,430), dialysis (U.S. 4,921,952), gel 

30 polymers (U.S. 5,106,966) and the use of chaotropic compounds with DNA binding 
reagents (U.S. 5,234,809). After purification, the DNA is eluted or otherwise 
resuspended in water, preferably distilled or deionized water, for use in 
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electFoporation at the concentrations of the invention. The use of low salt buffer 
solutions is also contemplated. 

Any suitable cells which can be transformed by electroporation may be used 
as host cells in the method of the present invention. Suitable host cells which can be 
5 transformed include gram negative bacterial cells such as E. colL Suitable E, coli 
strains include JMlOl, E. coli K12 strain 294 (ATCC number 31,446), E. coli 
stram W3110 (ATCC number 27,325). E. coli X1776 (ATCC number 31,537), E, 
coli XL- 1 Blue (Stratagene), and £. co/i B; however many other strains of E. coli, 
such as XLl-Blue MRF', SURE, ABLE C, ABLE K, WMllOO, MC1061, HBlOl, 
10 CJ136, MV1190, JS4, JS5, NM522, NM538, and NM539, may be used as well. 
Cells are made competent using known procedures. Sambrook ei al, above, L76- 
1.81, 16.30. 

10 

Cell concentrations of about 10 colony forming units (cfu)/mL) of viable 
living cells and greater are preferably used for electroporation. More preferably, the 

15 viable cells are concentrated to about 1 x 10^^ to about 4 x 10^* cfli/mL. Preferred 
cells which may be concentrated to this range are the SS320 cells described below. 
Cells are preferably grown in culture in standard culture broth, optionally for about 
6-48 hrs (or to ODm = 0.6 - 0,8) at about 37''C, and then the broth is centrifuged 
and the supernatant removed (e.g. decanted). Initial purification is preferably by 

20 resuspending the cell pellet in a buffer solution (e.g. HEPES pH 7.4) followed by 
recentrifugation and removal of supernatant. The resulting cell pellet is resuspended 
in dilute glycerol (e.g. 5 - 20% v/v) and again centrifuged to form a cell pellet and 
the supernatant removed. The final cell concentration is obtained by resuspending 
the cell pellet in water or dilute glycerol to the desired concentration. 

25 A particularly preferred recipient cell for the electroporation is a competent 

£. coli strain containing a phage F' episome. Any F' episome which enables phage 
replication in the strain may be used in the invention. Suitable episomes are 
available firom strains deposited with ATCC or are conunercially available (CJ236, 
CSH18, DHSalphaF', JMlOl, JM103, JM105, JM107, JM109, JMllO), KSIOOO, 

30 XLl-BLUE, 71-18 and others ), Strain SS320 was prepared by mating MC1061 
cells with XLl-BLUE cells under conditions sufficient to transfer the fertility 
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episome (F' plasmid) of XLl-BLUE into the MCI 061 cells. In general, mixing 
cultures of the two cell types and growing the mixture in culture medium for about 
one hour at 3VC is sufficient to allow mating and episome transfer to occur. The 
new resuhing £. coli strain has the genotype of MC1061 which carries a 
5 streptomycin resistance chromosomal marker and the genotype of the F' plasmid 
which confers tetracycline resistance. The progeny of this mating is resistant to both 
antibiotics and can be selectively grown in the presence of streptomycin and 
tetracycline. Strain SS320 has been deposited with the American Type Culture 
Collection (ATCC), 10801 University Boulevard, Manassas, Virginia, USA on June 
10 1 8, 1998 and assigned Deposit Accession No. 98795. 

This deposit of strain SS320 was made under the provisions of the Budapest 
Treaty on the Intemational Recognition of the Deposit of Microorganisms for the 
Purpose of Patent Procedure and the Regulations thereunder (Budapest Treaty). 
This assures maintenance of a viable culture for 30 years firom the date of deposit. 

15 The organisms will be made available by ATCC under the terms of the Budapest 
Treaty, and subject to an agreement between Genentech, Inc. and ATCC, which 
assures permanent and imrestricted availability of the progeny of the cultures to the 
public upon issuance of the pertinent U.S. patent or upon laying open to the public 
of any U.S. or foreign patent application, whichever comes first, and assures 

20 availability of the progeny to one determined by the U.S. Commissioner of Patents 
and Trademarks to be entitled thereto according to 3S USC §122 and the 
Commissioner's rules pursuant thereto (including 37 CFR §1.14 with particular 
reference to 886 OG 638). 

The assignee of the present application has agreed that if the cultures on 
25 deposit should die or be lost or destroyed when cultivated under suitable conditions, 
they will be promptly replaced on notification with a viable specimen of the same 
culture. Availability of the deposited cultures is not to be construed as a license to 
practice the invention in contravention of the rights granted under the authority of 
any government in accordance with its patent laws. 



30 Oligonucleotide-mediated mutagenesis is a preferred method for preparing 

the substitution, deletion, and insertion variants of the invention. This technique is 
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well known in the art as described by Zoller et ai (1987) Nucleic Acids Res. 10: 
6487-6504. Briefly, a gene encoding a protein fusion or heterologous polypeptide is 
altered by hybridizing an oligonucleotide encoding the desired mutation to a DNA 
template, where the template is the single-stranded form of the plasmid containing 
5 the unaltered or native DNA sequence of the gene. After hybridization, a DNA 
polymerase is used to synthesize an entire second complementary strand of the 
template which will thus incorporate the oligonucleotide primer, and will code for 
the selected alteration in the gene. Generally, oligonucleotides of at least 25 
nucleotides in length are used. An optimal oligonucleotide will have 12 to 15 
10 nucleotides that are completely complementary to the template on either side of the 
nucleotide(s) coding for the mutation. This ensures that the oligonucleotide will 
hybridize properly to the single-stranded DNA template molecule. The 
oligonucleotides are readily synthesized using techniques known in the art such as 
that described by Crea et aL (1978) Proc, Natl Acad, Sci. USA 75: 5765. 

15 The DNA template is generated by those vectors that are derived from the 

bacteriophage used in the phage display system, e.g. bacteriophage Ml 3 vectors (the 
commercially available MlSmplS and M13mpl9 vectors are suitable), or those 
vectors that contain a single-stranded phage origin of replication; examples are 
described by Viera et aL (1987) Meth. Enzymol 153:3. Thus, the DNA that is to be 

20 mutated can be inserted into one of these vectors in order to generate single-stranded 
template. Production of the single-stranded template is described in sections 4.21- 
4.41 of Sambrook et al 

To alter the native DNA sequence, the oligonucleotide is hybridized to the 
single stranded template under suitable hybridization conditions. A DNA 

25 polymerizing enzyme, usually T7 DNA polymerase or the Klenow fragment of 
DNA polymerase I, is then added to synthesize the complementary strand of the 
template using the oligonucleotide as a primer for synthesis. A heteroduplex 
molecule is thus formed such that one strand of DNA encodes the mutated form of 
the gene, and the other strand (the original template) encodes the native, unaltered 

30 sequence of the gene. This heteroduplex molecule is then transformed into a 
suitable host cell, usually a prokaryote such as E, Coli JMIOI. After growing the 
cells, they are plated onto agarose plates and screened using the oligonucleotide 
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primer radiolabelled with 32-Phosphate to identiiy the bacterial colonies that contain 
the mutated DNA. 

The method described immediately above may be modified such that a 
homoduplex molecule is created wherein both strands of the plasmid contain the 
5 mutation(s). The modifications are as follows: The single-stranded oligonucleotide 
is annealed to the single-stranded template as described above. A mixture of three 
deoxyribonucleotides, deoxyriboadenosine (dATP), deoxyriboguanosine (dGTP), 
and deoxyribothymidine (dTTP), is combined with a modified thio- 
deoxyribocytosine called dCTP-(aS) (which can be obtained fi-om Amersham). 

10 This mixture is added to the template-oligonucleotide complex. Upon addition of 
DNA polymerase to this mixture, a strand of DNA identical to the template except 
for the mutated bases is generated. In addition, this new strand of DNA will contain 
dCTP-(aS) instead of dCTP, which serves to protect it fi-om restriction endonuclease 
digestion. After the template strand of the double-stranded heteroduplex is nicked 

15 with an appropriate restriction enzyme, the template strand can be digested with 
ExoIII nuclease or another appropriate nuclease past the region that contains the 
site(s) to be mutagenized. The reaction is then stopped to leave a molecule that is 
only partially single-stranded. A complete double-stranded DNA homoduplex is 
then formed using DNA polymerase in the presence of all four deoxyribonucleotide 

20 triphosphates, ATP, and DNA ligase. This homoduplex molecule can then be 
transformed into a suitable host cell such as E, coli JMlOl, as described above. 

Mutants with more than one amino acid to be substituted may be generated 
in one of several ways. If the amino acids are located close together in the 
polypeptide chain, they may be mutated simultaneously using one oligonucleotide 
25 that codes for all of the desired amino acid substitutions. If, however, the amino 
acids are located some distance fi-om each other (separated by more than about ten 
amino acids), it is more difficult to generate a single oligonucleotide that encodes all 
of the desired changes. Instead, one of two alternative methods may be employed. 

In the first method, a separate oligonucleotide is generated for each amino 
30 acid to be substituted. The oligonucleotides are then annealed to the single-stranded 
template DNA simultaneously, and the second strand of DNA that is synthesized 
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from the template will encode all of the desired amino acid substitutions. The 
alternative method involves two or more rounds of mutagenesis to produce the 
desired mutant. The first round is as described for the single mutants: wild-type 
DNA is used for the template, an oligonucleotide encoding the first desired amino 
5 acid substitution(s) is annealed to this template, and the heteroduplex DNA molecule 
is then generated. The second round of mutagenesis utilizes the mutated DNA 
produced in the first round of mutagenesis as the template. Thus, this template 
already contains one or more mutations. The oligonucleotide encoding the 
additional desired amino acid substitution(s) is then annealed to this template, and 
10 the resulting strand of DNA now encodes mutations from both the first and second 
rounds of mutagenesis. This resultant DNA can be used as a template in a third 
round of mutagenesis, and so on. 

Cassette mutagenesis is also a preferred method for preparing the 
substitution, deletion, and insertion variants of the invention. The method is based 

15 on that described by Wells et ai (1985) Gene 34:315. The starting material is a 
plasmid (or other vector) containing the gene to be mutated. The codon (s) in the 
gene to be mutated are identified. There must be a unique restriction endonuclease 
site on each side of the identified mutation site(s). If no such restriction sites exist, 
they may be generated using the above-described oligonucleotide-mediated 

20 mutagenesis method to introduce them at appropriate locations in the gene. After 
the restriction sites have been introduced into the plasmid, the plasmid is cut at these 
sites to linearize it. A double-stranded oligonucleotide encoding the sequence of 
the DNA between the restriction sites but containing the desired mutation(s) is 
synthesized using standard procedures. The two strands are synthesized separately 

25 and then hybridized together using standard techniques. This double-stranded 
oligonucleotide is referred to as the cassette. This cassette is designed to have 3* and 
5' ends that are compatible with the ends of the linearized plasmid, such that it can 
be directly ligated to the plasmid. This plasmid now contains the mutated DNA 
sequence of the gene. Vectors containing the mutated variants can be transformed 

30 into suitable host cells as described above. 
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The transformed cells are generally selected by growth on an antibiotic, 
commonly tetracycline (tet) or ampicillin (amp), to which they are rendered resistant 
due to the presence of tet and/or amp resistance genes in the vector. 

Suitable phage and phagemid vectors for use in this invention include all 
5 known vectors for phage display. Additional examples include pCombS (Gram et 
al (1992) Proc, Natl Acad. ScL USA 89:3576-3580); pC89 (Felici et al (1991) J, 
MoL Biol 222:310-310); pIF4 (Bianchi et al (1995) J. Mol Biol 247:154-160); 
PM48. PM52, and PM54 (lannolo. (1995) J. Mol Biol 248:835-844); fdH 
(Greenwood et al (1991) /. Mol Biol 220:821-827); pfd8SHU, pfd8SU, pfdSSY, 

10 and fdISPLAY8 (Malik & Perham (1996) G^/ie 171:49-51); "88" (Smith (1993) 
Gene 128:1-2); f88.4 (Zhong et al (1994) / Biol Chem, 269:24183-24188); p8V5 
(Affymax); MBl, MB20, MB26, MB27, MB28, MB42, MB48, MB49, MB56: 
(Markland et al (1991) Gene 109:13-19). Similarly, any known helper phage may 
be used when a phagemid vector is employed in the phage display system. Examples 

15 of suitable helper phage include M13-K07 (Pharmacia), M13-VCS (Stratagene), 
and R408 (Stratagene). 

After selection of the transformed cells, these cells are grown in culture and 
the vector DNA may then be isolated. Phage or phagemid vector DNA can be 
isolated using methods known in the art, for example, as described in Sambrook et 
20 a/., Molecular Cloning: A Laboratory Manual, 2nd edition, (1989) Cold Spring 
Harbor Laboratory Press, Cold Spring Harbor, NY. 

The isolated DNA can be purified by methods known in the art such as that 
described in section 1.40 of Sambrook et al^ above and as described above. This 
purified DNA can then be analyzed by DNA sequencing. DNA sequencing may be 
25 performed by the method of Messing et al (1981) Nucleic Acids Res, 9:309 , the 
method of Maxam et al (1980) Meth. Enzymol 65:499 , or by any other known 
method. 

V. Applications 

The various aspects and embodiments of the present invention demonstrate 
30 the advantages of a novel model system for rationally designing and analyzing 

-38- 



wo 2004/035735 



PCTAJS2003/032450 



peptides of defined structural features. The combinatorial libraries comprising such 
peptides and methods of using thereof provide useful infonnation and tools for 
exploring the basic structure-activity relationships involved in almost all biological 
molecular interactions. The peptides disclosed herein or generated according to the 
5 disclosure of the invention can be candidates for various biological or therapeutic 
agents, including but not limited to, enzyme inhibitors, ligand antagonists, ligand 
agonists, toxins, and immunogens. 

The following examples are provided by way of illustration and not by way 
of limitation. All disclosures of the references cited herein are expressly 
10 incorporated herein by reference in their entirety. 

EXAMPLES 

Example 1: Design of a Structured, Disulfide-constrained p-hairpin Peptide 
Scaffold 

In this example, we chose to investigate disulfide-constrained P-hairpins of 
15 the decamers in the form of CXgC as scaffolds for P-tum display. For our purpose, 
it is essential to design a structure compatible with many turn sequences. That is, 
residues other than those in the turn must significantly bias the peptide toward 
hairpin structure. Disulfide cyclization is helpful, although not sufficient to structure 
many peptides. Our initial objective was to determine whether the disulfide bond 
20 could be used not only as a covalent constraint, but also to nucleate a more extended 
interaction of the p-strands. 

Materials and Methods 

Peptide Synthesis. Peptides were synthesized using standard Fmoc 
chemistry on a Pioneer synthesizer (PE Biosystems), cleaved from resin with 5% 
25 triisopropylsilane in trifluoroacetic acid (TFA), and purified by reversed-phase 
HPLC (acetonitrile/HjO/O.P/o TFA). Peptide identity was confirmed by mass 
spectrometry. Peptides were converted to cyclic disulfides by dropwise addition of a 
saturated solution of I2 in acetic acid and repurified by HPLC. Purified peptides 
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eluted as single symmetric peaks on CI 8 analytical columns (0-40% acetonitrile in 
40 minutes). 

Cysteine Effective Concentration Measurements. Glutathione stock 
solutions were prepared by mixing 3 volumes of 0.2 M reduced glutathione (GSH) 
5 with 1 volume of 0.1 M oxidized glutatione (GSSG). Aliquots were stored at - 80°C 
and were stable for several months; use of a single batch eliminated any error in 
AAG values that might arise from variability of total glutathione concentration. 
Thiol-disulfide equilibria were established by mixing SO peptide stock 
(approximately 3 mM in water) with SO glutathione stock, deoxygenating the 

10 acidic solution with vacuum/argon cycles from a Firestone valve, then adding 300 
\iL of deoxygenated buffer by syringe (0.2 M tris, pH 8.0; 1 mM EDTA; 67 mM tris 
base to titrate glutathione), followed by further deoxygenation of the mixture. The 
final pH of all reaction mixtures was 8.10 ±0.05. Solutions were stirred under 
argon and maintained at 20*^0 in a water bath. After 1.5 h, successive aliquots 

15 (100 ^iL) were removed with a gastight syringe, immediately quenched by discharge 
into 400 ^iL of 31 mM HCl, and analyzed by HPLC with a minimum of delay. C^^^ 
values were calculated from the molar ratios of the reduced and oxidized forms of 
peptide and glutathione (peak area ratios corrected for absorbance differences 
measured by HPLC), assuming 0.025 M total glutathione monomer (i. e., neglecting 

20 the minor amount (< 1 %) of glutathione present in mixed disulfides with peptide): 

Ceff = ([peptideox] / [peptidered]) x ([GSH]2 / [GSSG]) 
[GSH] + 2[GSSG] = 0.025 M 

[GSSG] = 0.025 M / {2 + 3.26 (GSH peak area / GSSG peak area)} 
[peptideox] / [peptidered] = equilibrium peak area ratio / absorbance 

25 ratio 

Two or three samples from each reaction mixture were analyzed; there were 
no shifts in populations with time, and calculated Ceff values typically varied by less 
than 5% (equivalent to 30 cal/mol uncertainty in AAG). 
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NMR Spectroscopy. NMR samples contained S-10 mM peptide in 92% 
H20/8% D2O pH 5.1 and 0.1 mM 1,4-dioxane as chemical shift reference. All 
spectra were acquired on a Broker DRX-500 or a Varian Umty-400 spectrometer at 
15T. 2QF-C0SY, TOCSY and ROESY spectra were acquired as described 
5 (Cavanagh et al (1995) Protein NMR Spectroscopy, Principles and Practices 
(Academic Press, San Diego) with gradient coherence selection (van Zijl et al 
(1995) J. Magn, Reson. 113A:265-270) or excitation sculpting (Hwang & Shaka, 
(1995) y. Magn. Reson. 112A, 275-279) for water suppression. Proton resonances 
were assigned by standard methods (Wiithrich (1986) NMR of Proteins and Nucleic 

10 Acids (John Wiley and Sons, New York). Vh^-H^ were obtained by fitting 
Lorentzian lines to the antiphase doublets of H^-H® peaks in 2QF-C0SY spectra 

processed to high digital resolution in F2> -^-^H^-H^ were extracted from COSY-35 
spectra acquired on D2O solutions of the peptides. Distance and dihedral angle 
restraints were generated as described (Skelton et al (1994) Biochemistry 33:13581- 

15 13592). 100 initial structures were calculated using the hybrid distance 
geometry/simulated annealing program DGII (Havel et al (1991) Prog, Biophys, 
Mol Biol 56:43-78.); 80 of these were further refined by restrained molecular 
dynamics using the AMBER all-atom forcefield implemented in DISCOVER as 
described previously (Skelton et al (1994) Biochemistry 33:13581-13592.). 20 

20 conformations of lowest restraint violation energy were chosen to represent the 
solution conformation of each peptide. 

Structure Calculation. Structures were calculated with 78 ROE-derived 
distance restraints (10 medium- and 28 long-range restraints; upper bounds of 5.4, 
4.3, 3.4 or 3.0 A) and 12 dihedral angle restraints. The final 20 structures had 
25 average maximum violation of distance and dihedral angle restraints of 
0.05 ± 0.02 A and 0.7 ± 0.2^ respectively; RMS deviation from the experimental 
distance and dihedral angle restraints were 0.007 ±0.002 A and 0,29 ±0.08°, 

respectively. The mean RMSD from the mean structure is 0.28 ± 0.04 A for N, C , 
and C atoms of residues Cysl-CyslO whilst 75% of residues had O, 4^values in the 
30 most favored portions of the Ramachandran plot (none were in the disallowed or 
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generously allowed region) (Laskowski et ai (1993) J. Appl Crystallogr. 26:283- 
291.). 

NMR Analysis. NMR samples of CD4 peptides contained -2 mM peptide 
in 92% H20/8% DjO, pH 3.5 with 50 ^iM 3-(trimethylsilyl)-l-propane-7,7,2,2,i,i,- 
5 tf^-sulfonic acid (DSS) as a chemical shift reference. Spectra were acquired and 
analyzed as described above. The structure of cd2 was calculated from 84 
(including 13 medium- and 23 long-range) ROE-derived distance restraints and 13 
dihedral angle restraints. The average maximum violations of distance and dihedral 
angle restraints are 0.05 ± 0.01 A and 0.6 ± 0.4% respectively; the KMSDs from the 
10 experimental distance and dihedral angle restraints are 0.009 ± 0.002 A and 0.2 ± 
O.r, respectively. The covalent geometry is good, with 74% of the O, ^ angles 
within the most favored and none in the disallowed or generously allowed regions of 
the Ramachandran plot (Laskowski ei a/.(1993) J. Appl Crystallogr, 26:283-291). 

Analysis of Sidechain Rotamers* Observation of both V^a-Hp^ ^Ai^- 
15 j^p2 in the range of 6-9 Hz indicates that a side chain does not occupy a single 
classical rotamer (xl - - 60°, + 60° or 1 80°), and most likely samples all three 
staggered rotamer wells. This is the situation for Trp3 and Leu8 of bhpW and Gln4, 
Phe7 and Leu8 of cd2. ROE peaks observed to these side chains represent a time- 
average over the range of xl values sampled. Not all of these conformations will 
20 give rise to readily observable ROE peaks, hence the structure calculation process 
will be biased towards those rotamers for which restraints could be obtained. For 
example, ROEs from Phe7 of cd2 are observed to protons in the opposite strand, 
thereby forcing Phe7 to lie in the + 60° rotamer well. Given the large number of 
backbone-backbone distance and O dihedral angle restraints, the structures 
25 calculated for bhpW and cd2 do accurately represent the solution conformation of 
these peptides, except in the over determination of some side chain orientations. 

Results 

In our survey of B-sheets from a set of 928 non-redundant protein structures, 
the mean C ^-C ^ distances between hydrogen-bonded and non-hydrogen bonded 
30 pairs of residues in adjacent strands were 4.82 ±0.58 and 5.37 ±0.56 A, 
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respectively, while the average C ^-C ^ distance in disulfide-bonded cysteines was 
3.84 A. Therefore, the C ^ atoms of opposing residues on antiparallel strands are 
normally too far apart for disulfide bond formation. Nonetheless, disulfide 
crosslinks are sometimes found between cysteines in the non-hydrogen-bonding 
5 register in B-sheets. We found 23 disulfide-bonded cysteine pairs joining adjacent 
antiparallel strands. In 14 of 23 cases, the disulfide packs tightly against the 
hydrophobic sidechain two residues before one (or both) of the cysteines (Fig. la). 
In 5 additional cases this hydrophobic site was occupied by a polar or charged 
residue with P and y-methylenes (E, Q, or R). In particular, the sidechains of either 

10 leucine or an aromatic amino acid provided good shape complementarity to this 
characteristic disulfide conformation. Accordingly, we chose leucine as residue 8 in 
our model peptide (Fig. lb), included threonines at positions 2 and 9 to promote an 
extended backbone conformation, and chose the turn sequence EGNK as a 
representative, but not overly strong, type 11' p-tum. To determine the best cross- 

1 5 strand pairing with leucine, position 3 was varied. 

The presence of a disulfide provides a convenient probe for hairpin stability. 
Thiol-disulfide equilibria were measured relative to the reference thiol glutathione, 
yielding effective concentrations (C^^) for the cysteine pairs. Larger values of C^^ 
indicate an increased proximity, on average, of the cysteine thiols, consistent with 

20 formation of the hairpin structure. This method has been used to evaluate the effect 
of residue substitutions in a 13-tum on hairpin stability. Stroup and Gierasch (1990) 
Biochemistry 29:9765-9771, Peptide C^jy values varied significantly for different 
residues at position 3 (Fig. 2a). Strikingly, tryptophan at position 3 strongly shifted 
the peptide equilibrium toward the oxidized form: this behavior was not caused by 

25 peptide aggregation. Scaling of the C^^ values to that of the alanine analog (-RT 
^"{^en-, X ^ ^cfT, aia}) yi^l^s &CC energy differences spanning > 0.8 kcal/mol (Fig. 2b) 
that can be interpreted as the relative tendencies of the peptides to fold. These data 
do not, however, distinguish between effects on the folded and unfolded states of the 
peptide. For example, a given substitution might promote favorable side chain 

30 packing in the oxidized peptide, or simply bias toward an extended backbone 
conformation in the reduced peptide. 
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To assess whether the peptides were indeed forming p-hairpins» several of 
them were evaluated by *H NMR spectroscopy (Table 1). The tryptophan peptide 
(bhpW) exhibited all the hallmarks of a highly populated P-hairpin in terms of 

a N 

intense sequential H -H NOEs, numerous backbone cross-strand NOEs and large 
5 backbone scalar coupling constants (^Jh^-H** > 8.0 Hz) for strand residues. The If* 
chemical shifts for Cysl and CyslO were downfield relative to values observed in 
unstructured peptides, indicating that the antiparallel strands encompass these 
tierminal residues. The other peptides studied were judged to have a lower 
population of hairpin structure (see Table 1). Interestingly, the NMR data correlate 
10 well with C^fj. (Table 1); thus, the disulfide exchange assay provides a useful 
quantitation of the degree of hairpin structure in the oxidized peptides. 
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Table 1. Comparison of cysteine effective concentrations (Ceff) and NMR 
data for selected model hairpin peptides 



Residue 3 
(X.Fig. lb) 




No. of3yHN.H">SHz 


6(Cysl Htt),ppm 


6(CyslOH<»), ppm 


Tip bhpW 


210±4 


7 


5.20 


5.00 


Tyr 


98±2 


7 


5.07 


4.91 


Phe 


88d:0 


5 


5.07 


4.92 


Leu 


85±l 


6 


5.04 


4.89 


Val 


73 ±0 


4 


4,97 


4.85 


Lys 


52±2 


3 


4.92 


4.82 


Asn 


52 il 


3 


4.84 


4.76 


random coil 




0 


4.71 


4.71 



The maximum number of strand residues with ^.^N.h**> 8 Hz is 8; for the 



5 tryptophan peptide (bhpW), the LeuS coupling constant is 7.9 Hz. Random coil 
coupling constants are taken from Smith et al (Smith et al. (1996) J. Mol Biol. 

225:494-506). Random coil VP- chemical shifts are taken from Wishart et al 
(Wishart et ai (1992) Biochemistry 31:1647-1651). 

10 Example 2: Transfer of Alternative Tetrapeptide Turn Sequences onto the Hairpin 
Scaffold 

Structures calculated for bhpW according to Example 1 revealed a well 
formed antiparallel hairpin with a type ir turn (Gly5-Asn6), and hydrophobic 
contacts between the side chains of Cysl, Trp3, LeuS and CyslO (Fig. 3). 

15 Thermodynamic analysis of bhpW stability was complicated by the failure of the 
oxidized peptide to unfold fully, either at high temperature or in the presence of 
chemical denaturants. Nevertheless, we estimate the hairpin conformation to be 
highly populated, most likely > 80%, at 15 °C. Because of its structural stability, we 
have chosen bhpW for investigation as a turn display scaffold. Accordingly we 

20 tested whether a different turn sequence could be structured by the bhpW strand 
sequences. 
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A recent crystal structure of HIV gpl20 bound to a neutralizing antibody and 
to hiunan CD4 revealed details of the contact surfaces (Kwong et al (1998) Nature 
393:648-659.). As had been anticipated from numerous mutagenesis studies, the 
CD4 region most important for gpl20 binding is the C'-C" hairpin loop (residues 37- 
5 46), with the critical Phe43 side chain extending from the protein surface. In fact, 
CD4 residues 40-48 contribute 63% of the surface area buried in the interface, with 
23% of the total contributed by Phe43 (Kwong et al (1998) Nature 393:648-659.). 
Unexpectedly, there is a large cavity in gpl20, behind the Phe43 binding site, that is 
lined with hydrophobic residues. It seemed possible that a structured peptide based 
10 on the C'-C" turn might bind to gpl20 and if so, might be a starting point for 
designing ligands that extend into the cavity seen in the crystal structure. 

We synthesized a disulfide-constrained peptide based on the native sequence 
of the CD4 hairpin (residues 38-45, cdl in Table 2) and found it to be essentially 
unstructured in solution (Fig. 4a). We then made the substitutions G2T and N3W, to 
15 match the corresponding residues in bhpW (cd2, Table 2); residues L8 and T9 are 
already present in the native CD4 sequence. Peptide cd2 is well-ordered, adopting a 
hairpin structure with a type 11* turn (Fig. 4). In Figure 4A, peak assignments for 
cd2 are shown; arrows indicate the location of the corresponding crosspeak in cdl. 

Those cd2 residues with Vh^-h" > 8.3 Hz are underlined; all cdl residues have 
20 backbone coupling constants between 5.9 and 7.7 Hz. From the measured Ceff 
values, the CD4 turn (QGSF) destabilizes the model hairpin (EGNK turn) by 0.5 
kcal/mol, and we found that both the T2 and W3 substitutions were necessary for 
stable hairpin structure (Table 2). Importantly, comparison of the peptide structure 
with that of CD4 indicates that the backbone conformations are essentially the 
25 same, within the uncertainties of the structure determinations (0.93 A RMSD; Figure 
4B). Figure 4B shows the NMR structure ensemble for cd2 (20 models; two 
orthogonal views) shown superimposed on CD4 residues 37-46 (red) from the 
crystal structure of gpl20-bound CD4. The RMSD for the 20 models, with respect 
to the mean coordinates for the backbone atoms of residues 1-10, is 0.50 ± 0.09 A; 
30 comparison of the mean coordinates of residues 1-10 with residues 37-46 of CD4 

from the crystal structure yields an RMSD of 0.93 A. Note that the ^J^:^^ 
coupling constants for Phe7 of cd2 indicate that this sidechain is not fixed in the 
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rotamer seen in the ensemble; the Phe7 sidechain adopts multiple conformations in 
solution, undoubtedly sampling that observed in the co-crystal structure. This 
demonstrates that the peptide scaffold correctly presents the CD4 p-tum. 

Table 2. Comparison of bhpW and peptides based on the CD4 C*-C*' 

5 loop 



Peptide Ceff [91215 No.of^HN. 6(CysNH«) 5(CyscHa) 

(mM) (degcm^ Ha>8H2 (ppm) (ppm) 
dmoK ) 

AC-CTWEGNKLTC-NH2 bhpW 2 1 0 ± 4 ■ 1 9.800 7 120 
(SEQIDNO: 2) 

SCTWEGNKLTCK.NH2 273 ±2 - 17,400 n. d. n.d. n. d. 

(SEQroNO: 3) 

AC-CGNQGSFLTC-NH2 cdl n. d. n. d. 0 4.66 4.66 

(SEQ ID NO:4) 

AC-CIWQGSFLTC-NH2 crf2 n.d. - 15,800 6 5.08 4.93 

(SEQIDNO:5) 

SCGNQGSFLTCK-NH2 c^/fl 45 ± 4 - 1,500 0 4.80 4J2 

(SEQIDNO:6) 

SCINQGSFLTCK-NH2 n.d. -5,000 2 4.96 4.79 

(SEQIDN0:7) 

SCGWQGSFLTCK-NH2 48 ± 0 - 6,100 3 5.00 4.88 

(SEQIDN0:8) 

SCIWQGSFLTCK-NH2 c(/2a 120±0 - 14,000 6 5.36 5.14 

(SEQ ID NO:9) 

n.d.: not determined; CysN H^: chennical shift for the more N-terminal cysteine (Cysl 
or Cys2); Cysc H**: chemical shift for the more C-terminal cysteine (CyslO or Cysl 1). 



Terminal serine and lysine residues were added to improve the solubility of 
10 some variants of the CD4 peptide, which are otherwise uncharged. A similar 

modification was made to bhpW as a control. Non-turn residues that differ between 
bhpW and the CD4 loop are underlined. Coelution of reduced and oxidized peptides 
prevented measurement of Ceff for the T2, N3 variant of the CD4 peptide. Circular 
dichroism spectra were acquired at 10 'C with an Aviv Instruments, Inc. Model 202 
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Spectrophotometer; peptide concentrations were 20 \iM in 20 mM potassium 
phosphate, pH 7.0. 

Two other turn sequences evaluated were VWQL from the F-G loop of 
domain 2 of human Fc-epsilon-RI, and GPLT from the EPO agonist peptide EMPl. 
5 All three turns were evaluated in the trp peptide scaffold and in cyclized peptides 



whose sequence matched more closely the native parent hairpin loops: 

SCGNQGSFLTCK-NH2 (SEQ ID NO: 1 0) CD4 peptides a 

SCTWQGSFLTCK.NH2 (SEQ ID NO: 1 1) b 

AC-CTKVWQLWTC-NH2 (SEQ ID NO: 12) Fc-epsilon-RI peptides c 

10 SCTWVWQLLTCK-NH2 (SEQ ID NO: 13) d 

SCHFGPLTWVCK.NH2 (SEQIDN0:14) EMPl peptides e 

SCTWGPLTLTCK-NH2 (SEQ ID NO: 1 5) f 



Circular dichroism spectra show that in each case, the designed trp hairpin 
scaffold yields a more structured peptide (Fig. Sa-c). NMR data are consistent with 
15 increased hairpin structure in the peptides, demonstrating that the scaffold can bias a 
variety of "difficult" turns toward structured states. 

Other conmion tums that can be presented on the hairpin scaffold include 
gamma-turns (3 amino acids), bulged tums (5 or 6 amino acids), and longer hairpins 
(8 amino acids). Other turn lengths are known and are also compatible with the 
20 scaffold. 

The results in Example 1 and 2 demonstrate that optimization of a single 
strand position in a small disulfide-constrained hairpin is sufficient to convert a very 
poorly structured molecule to one that is highly structured (-AAG > 0.8 kcal / mol). 
The stem portion of the structured haiipin, -CTW — LTC-, does not require an 
25 optimized turn sequence; thus, it is a suitable scaffold for display of P-tum libraries 
and for studying particular tums that might not otherwise be highly populated. 
Importantly, only natural amino acids are required, so turn libraries may be 
displayed on phage. 
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It is interesting to compare the substitution energies we report here with 
previous studies on P-sheet systems. Although the magnitude of the energy 
differences is similar, the rank order we obtain does not correlate with experimental 
P-propensity scales or with observed residue pair frequencies in known P-sheets 
5 (Hutchinson et al (1998) Protein Sci, 7:2287-2300, Wouters, M. A. & Curmi, P. M. 
G. (1995) Proteins 22:119-131). In particular, tryptophan is unexceptional in such 
scales. These differences stress that average trends in typical protein domains may 
not apply directly to small peptides in which most residues are highly solvent 
exposed, complicating the use of such information in de novo design. Furthermore, 
10 A AG rank order does not correlate well with increasing non-polar surface area of the 
side chains, although the preferred residues are hydrophobic. 

Finally, the hairpin stem is very small, yet the combination of disulfide and 
cross-strand tertiary contact imparts a structural bias exceeding that of a disulfide 
alone, e. g. CX4C. Although it is known that some particular sequences (e. g., 

15 WW) (Milbum et al, (1988) Int, J. Peptide Protein Res. 31:311-321.) cannot 
adopt turn conformations compatible with our hairpin, it is also true that very few of 
the turn sequences observed in proteins have been shown to adopt well defined turn 
conformations in isolated peptides. We have demonstrated a simple strategy to 
increase this number. We envision that hairpin libraries with randomized turn 

20 sequences (e. g., XCTWX4LTCX) might yield structured ligands whose binding 
determinants could be transferred readily to small synthetic turn mimetics or even 
used directly to identify small-molecule leads for high-throughput af&nity 
optimization (Rohrer et al. (1998) Science 282:737-740) . 

Example 3: Quantification of the Relative Contributions firom Turns and Cross- 
25 Strand Interactions 

In Example 1 above, substitutions were introduced into position 3 (X) of the 
model peptide bbp (peptide 1). This guest site is quite close in space to the type 11' 
turn (gly-asn. Fig. 1). To further investigate whether hairpins with different turn 
sequences and geometries would have different residue preferences at the NHB 
30 guest site, the central gly-asn sequence in model peptide 1 is replaced with the type 
r turn asn-gly (peptide 2) and the type 11' turns D-pro-asn and D-pro-gly (peptides 3 
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and 4). Substitutions at position 3 (X) were chosen to span the range of hairpin 
stabilities we observed in the gly-asn series. Cefr was measured as previously 
described in Example 1. The values we obtained for the different turns are 
compared in Figure 6. 

CTXEGNKLTC 1 ir SEQIDN0:16 

CTXENGKLTC 2 V SEQ1DN0:17 

CTXEpNKLTC 3 IF SEQIDN0:18 

CTXEpGKLTC 4 II' SEQIDN0:19 

5 X=W, Y, L, V, T. or D; p=D.pro. 

In all cases, tryptophan at position 3 yields the largest Ccfr value for a given 
turn, demonstrating that its stabilizing influence is general. The large changes in Ccfr 
for the different cross-strand interactions (horizontal axis) and turn sequences 
(vertical axis) show that both can contribute significantly to stability in these cyclic 
10 hairpin peptides. Finally, there are striking linear correlations between data sets, 
indicating that substitutions at strand position 3 and the turn replacements make 
independent contributions to stability of the cyclic hairpin. These data suggest that 
the hairpin fold may be quite modular, which would significantly simplify hairpin 
design. 

15 Relative turn energies can be calculated by comparing Cefr for the 

appropriate pairs of peptides. However, the correlation in Figure 6 allow the 
calculation of relative turn energies fi-om the slopes, which should be less sensitive 
to experimental error. These values are listed in Table 3. Compared to asn-gly (type 
I ), gly-asn (type 11) is less stablizing, while the D-pro-containing turns (also type 

20 IF) enhance hairpin stability. In the one case where a comparison may be made, 
asn-gly vs. D-pro-gly, the AAG value obtained here agrees reasonably well with that 
obtained by NMR. This suggests that the reference states assigned by Syud et ai 
(1999) J Am Chem Soc 121:11577 and their assumption of two-state folding are 
appropriate for their model system; however, defining such reference states is not 

25 always feasible. 
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Table 3. Turn Energies Relative to Asn-Gly in peptides 1-4 



mm sequence 


CefT correlation 
slope 




AAG. kcal mol * 


asn-gly repeat * 


1.19 


0.99 


-0.10 


gly-asn 


0.29 


0.97 


0.72 


D-pro-asn 


2.72 


0.98 


-0.58 


D-pro-gly 


3.01 


0.99 


-0.64 



independent sets of measurements were made for the asn-gly peptides in order to assess the reliability 
of the assay over time. AAG for the two data sets (- 100 cat mol*^) may be taken as an estimate of 
5 the error in the turn energies reported here. ' AAG may be compared to the value of -0.52 ± 0.1 1 
kcal mor ' (277 K) recently reported by Syud et al (1999) supra. 

Alternatively, substitution energies for the strand position may be obtained 
by plotting the same data, grouped instead by the residue X (not shown). The 
correlations are again excellent, and the slopes yield the free energy changes (Table 

10 4). The range of energies is larger than that reported in Example 1 for peptide 1 
(1 .42 vs, 0.85 kcal mol"*). Much of the difference is traced to those substitutions at 
the bottom of the stability scale (particularly asp). The less stable of the gly-asn turn 
peptides are not detectably structured, and Ccir assays do not register any difference 
between them. Thus, the data obtained in peptides with the stronger turn sequences 

1 5 provide a more complete view of the strand substitution energies. 

Table 4. Relative Energetic Contributions from Strand Residue X 

residue X slope" AAG, kcal mol"* AAG, GN tum*^ 

trp 2.92 -0.62 -0.53 

tyr 1.27 -0.14 -0.08 

val 0.66 0.24 0.09 

thr 0.45 0.46 0.30 

asp 0.25 0.80 0.32 

" Cerr values were plotted against those of leucine peptides 1-4. * AAG = -RT ln(slope), T = 293 K. 

AAG for the gly-asn turn series (AAG = -RT In (Cefr. x/Cefr. ku) ), as described in Example 1 . 

In order to assess how the turn types affect the Hairpin structure, the 
20 tryptophan analogs of 2 and 3 were characterized by NMR spectroscopy, and 
structures were calculated as described in Example I for peptide 1. The comparison 
of minimized mean structures in Figure 7 reveals that the backbone and side chain 
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conformations are very similar for the non-turn residues (RMSD - 0.3 A) regardless 
of the type of turn present. Thus, consistent with the linear correlations in Cerr (Fig. 
6), these three turns do not exert any structural influence on the adjacent strands. 

The importance of the turn sequence and good cross-strand pairing to hairpin 
5 structure has been addressed in many model studies. However, there is little 
quantitative data or systematic evaluation of residue substitutions. Our data show 
that, for these simple cyclic peptides, substitutions in a strand site and in the turn 
conform to simple linear free-energy relationships and have independent and 
additive effects on hairpin stability. This is unexpected, given their proximity in the 
10 structure (Figs. 3 and 7) and the reported sensitivity of calculated turn energies to 
features of the protein anchorage Mattos et al. (1994) J, Mol Biol. 238:733. 
Nonetheless, it would appear that coupling between these turns and the strands is 
negliglible compared to the large influence each exerts alone. This suggests that p- 
hairpin stability may be understood by separate analysis of these components. 

15 Example 4: Quantification of Energetic Contributions from Cross-Strand 
Residues 

The results of above Examples revealed tryptophan to be quite stabilizing in 
the non hydrogen-bonded (NHB) strand site X of peptide 1, when paired with a 
cross-strand leucine. The tryptophan peptide (bhpW) was highly structured in water, 

20 adopting the intended hairpin confomiation (Fig. 3). Here we investigate the 
relationship between the NHB cross-strand residues. Remarkably, we find that 
residue preferences for the two structurally inequivalent sites are the same, and that 
specific pair interactions produce only minor deviations from the single site 
contributions. Accordingly, a tryptophan-tryptophan cross-strand pair appears to be 

25 optimal for hairpin stability. 

Our observation of a stabilizing contribution from tryptophan prompted us to 
question how general the effect might be. The tryptophan in peptide bhpW (Fig. 3) 
is spatially near both the cross-strand leucine and the side chains of residues in the 
type ir turn. Therefore, it seemed possible that the effect of tryptophan might 
30 depend on stabilizing contacts with these other residues. In order to address this 
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question, we reversed the hydrophobic pairs (peptide 5), varying the amino acid at 
position 8 (nearest the disulfide, Fig. 3) with leucine fixed at position 3. Effective 
concentrations (Ccfr) of the cysteine thiols were determined as in our previous 
studies. 

CTXEGNKLTC 1 SEQIDNO:20 

CTLEGNKXTC 5 SEQ ID N0:2 1 

CTXEGNKWTC 6 SEQ ID NO:22 

CTWEGNKXTC 7 SEQ ID NO:23 

5 X=W, Y,F,L,M,I, VorA. 

We fmd that tryptophan at position 8 is the most stabilizing of those residues 
tested (Fig. 8A). Significantly, the Cen* values are quite close for the tip-leu and leu- 
tip pairs, indicating that the two arrangements are about equivalent energetically* 
This result appears to hold for other residue pairs with leucine: the rank order and 
10 numeric values of Cerr are similar, but not exact, in the two series (Fig. 8 A). 

To test whether the equivalence of reversed hydrophobic pairs might be more 
general, we examined peptide series 6 and 7, in which residues are instead paired 
with a cross-strand tryptophan (Fig. 8B). As with leucine pairs, a close 
correspondence is seen between the two tryptophan series, both in rank order and 
15 value of Cefr- We conclude that the two cross-strand sites are essentially equivalent, 
despite the differences in side chain position relative to the turn and disulfide. 

The two leucine series (1 and 5) may be compared to the tryptophan series (6 
and 7). The trends in the two data sets are remarkably similar (Fig. 8A and B), 
suggesting that the cross-strand residues contribute to stability in an independent 
20 manner. To explore this idea, we calculated free energy differences for substitutions 
in each of the peptide series relative to a reference peptide in that series (AAG = - 
RT In (Ccff^x^^cfT.rcf))- Representative comparisons are plotted in Figure 9. 

Linear free energy relationships exist among the four data sets. This is seen 
in comparisons of particular cross-strand pairs switched between NHB sites 3 and 8, 
25 and also for comparisons of trp pairs with leu pairs in the same orientation (Fig. 3). 
(There is more scatter in the latter correlations.) The slopes {p) were determined 



-53- 



wo 2004/035735 



PCTAJS2003/032450 



using AAG values scaled to two different reference peptides in each series = ala 
and trp). The pvalues obtained were not greatly different (Table 5). 

Table 5. Slopes (p) of Hammett plots for peptide series 1, 5-7 

X-axis data set X3 vs. X8 leu vs. trp 

1.15(1.11)* 0.47(0.43)* 
L3X8 0.98(0.86)* 

X3W8 0.43 (032/ 

Plots vs. W3X8 data are shown in Figure 9. Values in parentheses were obtained using the 
5 tryptophan peptide in each series as internal reference (instead of the alanine peptide). 

Consistent with the idea that positions 3 and 8 are equivalent, p is near 1 for 
plots comparing these data. In contrast, when leu pairs are compared to trp pairs, p 
is about 0.4. This means that for a given pair of residues X, the expected difference 
in hairpin stability is -2.5-fold larger with trp as the cross-strand partner. Given 
10 these simple relationships, AAG could be calculated for any substitution relative to a 
reference pair by multiplying a substituent energy ax by p for the cross-strand 
partner (see below). This is surprising, as these residues are within contact distance, 
and it has important implications for P-hairpin design. 

Statistical analyses of HB and NHB cross-strand pairs in p-sheet proteins 
15 find many residue pairs to be positively or negatively correlated with high 
confidence. Largely in accord with the statistical preferences, protein mutagenesis 
studies have identified interaction energies as large as 1 kcal mol'^ between HB 
pairs. It has been proposed that including cross-strand pairs preferred in proteins 
might improve stability or fix strand register in isolated p-hairpins. 

20 To look for these effects, we calculated pair interaction energies (Fig. 10). 

CefT ratios yield AAG for the single or double substitutions. Typically, the difference 
between AAG for the double substitution and lAAG for the single substitutions is 
taken as an interaction energy. In the example shown, this would be -136 cal mof* 
for the trp-tyr pair relative to a leu-leu reference state. If the the single substitution 

25 energies are calculated sequentially, scaling by p in the second step, the discrepancy 
is only +41 cal mof' (insignificant in these experiments). For the phe-trp pair 
(compare Figure 2, top and bottom), a similar analysis yields AAAG = -253 cal mof^ 
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when p is included. This discrepancy is significant, and it suggests that there might 
be some specific structural advantage in pairing phe with trp (beyond the general 
superiority of trp). It is interesting that the discrepancy is small when compared to 
the total range of energies seen for single site substitutions (Fig. 9); we believe that 
5 such pair-specific effects (and experimental error) may explain the scatter in our 
correlations. 

Our experiments show clearly that cross-strand tertiary contacts enhance 
hairpin stability. Notably, introduction of the trp-trp pair results in a large 
stabilization compared to our original peptide bhpW (Figs. 3 and 8), and we believe 

10 that, despite its rarity in proteins, trp-trp is the optimal NHB pair for isolated 
hairpins. In most cases, the pair interaction energies we obtain through conventional 
double mutant analysis are adequately explained by differences in p. That is, these 
energies are not specific to a single pair, but instead reflect greater or lesser 
sensitivity to all residue substitutions opposite a given cross-strand partner. 

15 Therefore, we conclude that the combined single site preferences (a and p) are most 
important in predicting hairpin stability. Significantly, it should be possible to make 
these predictions from a limited basis set of experimental data. 

Example 5: Construction of phape-displaved libraries based on the trp peptide 
scaffold 

20 Libraries of random peptides fused to the gene 8 protein of the filamentous 

bacteriophage Ml 3 were produced by Kunkel mutagenesis of plasmid pS 1302b, a 
derivative of pS349 (U.S. patent application Nos. 60/103,514 and 60/134,870, 
incorporated herein by reference). Plasmid pS 1302b includes the tac promoter and 
malE leader sequence of pS349. The hGH sequence and Gly/Ser-rich linker 

25 sequence of pS349 were replaced by the sequence: 

S^TAA-TAA-TAA-ATG-GCT-GAT-CCG-AAC-CGT-TTC-CGC-GGT-AAA-GAT-CTG- 
GGT-GGC-GGT-ACT-CCA-AAC-GAC-CCG-CCA.ACC-ACT-CCA-CCA-ACT-GAT- 
AGC-CCA-GGC-GGTO' (SEQ ID NO:24) 

The inserted sequence encodes three stop codons, the GD epitope tag, and a 
30 linker selected for high-level display of hGH. The plasmid also includes the lac 
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repressor (lacM) and the ampicillin resistance gene from pS349. The 
oligonucleotide used to construct the library was: 

5*«TCC-GCC.TCG-GCT-TAT-GCA->nsIS-TGC-ACT.TGG->mS-hW 
ACT-TGT.NNS-ATG.GCT-GAT-CCG-AAC-CGT-3' (SEQ ID NO:25) 

5 The form ofthe random peptides was therefore XCTWX4LTCX, A library 

9 10 

of 10 to 10 individual transformants was prepared by previously described 
methods (U.S. patent application Nos. 60/103,514 and 60/134,870). Approximately 
one-third of individual clones encoded a functional peptide sequence. The 
remainder were starting template, contained stop codons, or contained single 
10 nucleotide deletions. The library size is thus adequate to include several copies of 
each possible random sequence. 

Example 6: Selection of binding peptides from the structurallv-biased librarv 

Nunc MaxiSorp plates were coated overnight with 2 \xg/mL rhuFc-epsilon- 
RI-IgG fusion in PBS. Plates were then blocked for one hour at room temperature 

15 with 0.5% BSA (Sigma A-7638) in PBS. Negative wells were prepared by coating 
only with 0.5% BSA. Phage (lo'* ifii per well) were added to ten each positive and 
negative wells and incubated with shaking for 20 h at room temperature. After 
extensive washing to remove nonspecifically-bound phage, binders were eluted by 
treatment with 0.2 M glycine, pH 2 for five minutes. The eluted phage were then 

20 neutralized by addition of TRIS base and used to infect a culture of E. coli (XLl- 
blue, Stratagene). Several cycles of binding, elution, and amplification (3-5 total) 
were conducted under similar conditions. 192 individual clones were screened for 
binding to the target receptor by incubation of phage supernatant with plates 
prepared as described for phage sorting. After washing, wells were treated with 

25 alpha-M13-HRP conjugate (Pharmacia Biotech 27-9421-01), and bound antibody 
was detected with OPD substrate (Sigma P-9187). Plate absorbance (A492-A405) 
was compared between positive and negative plates to identify those clones positive 
for receptor binding. Twelve such clones were identified. 

The sequence of positive clones is identified by sequencing the encoding 
30 DNA. Peptides corresponding to the displayed sequences (i.e., 12-mers) are 
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synthesized using standard solid-phase methods. The peptides are then assayed 
using an appropriate biological or binding assay to determine their potency. 
Peptides can be evaluated for hairpin structure using any of the known techniques 
outlined above: circular dichroism, NMR, or disulfide equilibrium. Substitutions 
5 may then be made in the peptides to determine the relative contributions of the 
selected turn residues to binding. Ideally, these substitutions will not disrupt the 
scaffold structure. Once the nature of the binding motif is understood, the turn 
sequence can then be transferred onto a suitable organic scaffold for further 
optimization. 

10 Example 7. Stability of the Decamer bhp scaffold with Specific HB Residues 

The present example investigates substitutions at the hydrogen-bonded (HB) 
strand positions 2 (i.e., Al) and 9 (i.e., A5) of the bhp scaffold. The results show 
that the stability determinants at these sites are different than those at the non- 
hydrogen-bonded (NHB) positions 3 (i.e., A2) and 8 (i.e., A4). In marked contrast 

15 to the observed equivalence of positions 3 and 8, positions 2 and 9 have very 
different residue preferences fi:^om each other. In particular, a very large 
stabilization to the cyclic hairpin fold from a valine residue at position 9; a much 
smaller stabilization occurs in Val2 variants. This effect is evident fi'om Cerrderived 
free energy differences and from NMR structural analysis of Val9 and Val2 bhp 

20 analogues. Further evidences suggest that the strong stabilizing effect of Val9 in the 
bhp peptides is a consequence of its proximity to the cross-strand disulfide. 
Intriguingly, Val9 is often present in disulfide-cyclized P-hairpins selected from 
phage-displayed peptide libraries, suggesting that its stabilizing effect is general for 
this fold. 

25 In order to evaluate the residue preferences at hydrogen-bonded strand 

positions of p-hairpins, the previously described cyclic peptide bhpW was used as a 
host. This peptide adopts a well-defined hairpin conformation, and its NMR 
structure is illustrated in Figure 3. The disulfide of bhpW helps to stabilize the 
hairpin conformation and simultaneously provides an assay for structure. Changes 

30 in hairpin stability can be quantified by changes in the position of the thiol-disulfide 
equilibrium. Residues 2 and 9 of bhpW are cross-strand from one another and form 
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backbone hydrogen bonds. Several substitutions were then introduced at positions 2 
and 9 in bhpW: 



bhpW 
X2 series 
X9 series 
KGNE turn 
V2H9 pair(«VH) 
H2V9 pair ("HV") 
HV, X3 series 



CTWEGNKLTC 
CXWEGNKLTC 
CTWEGNKLXC 
CTWKGNELTC 
CVWEGNKLHC 
CHWEGNKLVC 
CHXEGNKLVC 



SEQ ID N0:2 

SEQIDNO:26 

SEQIDNO:27 

SEQIDNO:28 

SEQ IDNO:29 

SEQIDNO:30 

SEQIDN0:31 



The relative stabilities of the bhpW variants were determined from the 
5 effective concentrations (Ccfr) of the cysteine thiols (see the experimental section of 
Example 1) and are shown in Figure 11. Substitutions at both positions have large 
effects on hairpin stability. The range of stabilities is 1 kcal mol"^ for substitutions 
at position 2 (Fig. 1 1 A) and 0.7 kcal moF* for the same substitutions at position 9 
(Fig. 1 IB). The rank order of residue preferences at each of these positions is rather 

10 different than what were observed previously for the NHB positions 3 and 8 (see 
Example 4). Significantly, the pattern of stability changes for position 2 is 
completely different from that of position 9. In contrast, residue preferences are the 
same for positions 3 and 8. Therefore, in addition to the influence of a cross-strand 
hydrogen bond between residues 2 and 9, the bhp hairpins exhibit a localized 

15 asymmetry that analysis of residue preferences in these sites. 

It is possible that the asymmetry in residue preferences at positions 2 and 9 
reflects differences in contacts with nearby side chains. Peptides were prepared in 
which a valine residue was paired with a cross-strand histidine instead of a 
threonine. We also swapped residues 4 and 7, whose side chains are on the same 
20 face of the hairpin as residues 2 and 9. In another hairpin model system, "diagonal" 
interactions between side chains of NHB residues, placed in proximity by the 
interstrand twist as are HB residues 4 and 9 here, have been proposed to influence 
hairpin stability. Syud et al. (2001)/ Am. Chem. Soa 123:8667-8677. In both 
cases, the strong stabilizing effect of valine at position 9 remains (Figure 12). It 
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would seem therefore, that some structural feature other than particular side chain- 
side chain contacts is responsible for this effect. 

Tryptophan residues at NHB strand positions have been associated with a 
distinct twist in p-hairpin structures; it is possible that this creates a local 
S conformational distortion, influencing residue preferences at adjacent positions. To 
investigate this, we fixed residues 2 and 9 in the more stabilizing "HV" combination, 
then varied position 3. The relative stabilities of these peptides are compared to 
those of the analogous T2T9 peptides in Figure 13. The HV peptides are uniformly 
more stable than their TT counterparts, as indicated by their higher Ceir values. 

10 Significantly, the substitution firee energy differences within each series are linearly 
correlated. This demonstrates that the large stabilization imparted by the valine 9 
substitution does not depend on the presence of a particular residue pair at the 
neighboring NHB sites. Interestingly, the slope of the plot is not 1, but instead 
indicates that the HV host is more sensitive to substitutions at position 3 by a factor 

15 of 2. Taken together, the data in Figures 11-13 suggest that the strong, asymmetric 
stabilization by valine at position 9 is a general property of CXgC disulfide- 
constrained hairpins. 

In order to assess the effect that the substitutions at the HB site had on 
hairpin conformation, structures were determined for HV and VH on the basis of ^H 

20 NMR data. The NMR data for HV are in full agreement with this being a highly 
structured p -hairpin: the ^H NMR secondary chemical shifts (the differences 
between observed and random coil shifts) are much larger than for bhpW, and Vna- 
Hp coupling constant values indicate defined side chain rotamers for three pairs of 
strand residues (Cysl-CyslO, His2-Val9, Trp3-Leu8; Table 6). In particular, His2 

25 and Val9 can be unambiguously assigned to adopt xi rotamers of +60° and 1 80°, 
respectively. For peptide VH, the restraints generated fi^om the NMR data clearly 
define a conformation, but the H* secondary chemical shifts are less extreme than 
for HV. Furthermore, Vjia-Hp coupling constants indicate motional averaging for 
the Val2, His9, Trp3 and LeuS side chains (Table 6). All of these features indicate 

30 reduced stability for VH compared to HV, consistent with the Ceft measurements. 
The relative structural stability of HV and VH does not depend on the presence of 
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histidine; very similar NMR parameters are also observed for peptides in which the 
histidine is replaced by threonine (Table 6). That is, TV has well-defined side chain 
conformations, whereas VT exhibits conformational averaging at the side chain 
level. 

5 Table 6. ^Th^ »Hp for Hydrogen Bonded Cross-Strand Residues 



peptide position 2 position 9 Ccfr, mM 

bhpW (TT) T9 nA? 6^8 rud, 210 

HV 3.8,3.8'^ +60' 9.8 180' 1181 

VH 5.6 n.d. 7.4,7.6 n.d. 293 

TV 4.2 +60' 9.7 180* 677 

VT 5.9 n.d. 6.5 n.d. 305 



^ Values representing non-averaging X\ torsion angles are indicated by bold typeface. ^ not defined. 
not applicable. 

The structures determined for HV and VH are quite similar at the backbone 
level (RMS difference of 0.37 A between the mean structures): both peptides adopt 

10 a type IV turn and have a considerable right-handed twist of the strands, as was seen 
previously for bhpW. Calculation of the interstrand twist angle indicates that for the 
cyclized hairpins, the twist is not uniform, with less twisting adjacent to the disulfide 
(0 < 20° between the Cysl, CyslO pair and the 2, 9 pair) and more twisting before 
and after the NHB pair Trp3, Leu8 (0 30*^). Since these measurements involve 

15 atoms from both strands, the degree of twist present in each individual strand is not 
apparent from 0. Alternatively, the sum of \|/ from one residue and ^ from the 
succeeding residue (v|// + gives an indication of the twist between consecutive 
a-carbons in the same strand, with non-twisted strands having a value of zero and 
strands of right-handed twist having a positive value. The twist data for the HV and 

20 VH ensembles are compared to bhpW in Figure 14. These data clearly indicate that 
the majority of the twist contributing to the large interstrand 0 values occurs on the 
N-terminal side of the NHB hydrophobic residues (45-65*^ between HisA^al/Thr2 
and Trp3, and between Lys7 and Leu8, compared to < 20^ between Trp3 and Glu4 
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and between Leu8 and Val/His/Thi9). Although the ranges of + within the 
three ensembles overlap, the data in Figure IS indicate that compared to VH, HV 
has a slightly more pronounced twist preceding the NHB sites and a less pronounced 
twist around Cysl (assuming a normal distribution of dihedral angles within each 
5 ensemble, an independent samples t-test indicates that the mean v|/,- + 1 for HV and 
VH are different for residues 1-2, 2-3, and 7-8 with a p-value of less than 0.01). 
This suggests that swapping the location of His and Val side chains does have a 
subtle influence on backbone geometry. 

The main difference between the HV and VH ensembles is the degree of side 
10 chain order. Although Cysl, Trp3, Leu8 and CyslO adopt the same X\ in HV and 
VH (-60° for Cysl, Trp3 and CyslO; 180° for Leu8), the magnitudes of the Vho-hp 
are less extreme in the latter, suggesting that the side chains of VH are transiently 
sampling alternate conformations. More importantly, in HV the Xi rotamers of His2 
and Val9 are fixed (+60° and 180°, respectively) bringing these two side chains into 
IS van der Waals contact above the cross-strand hydrogen bonds. Due to the twisting 
of the strands, the y2 methyl group of Val9 is also brought into van der Waals 
contact with the side chain methylene groups of both Glu4 and Lys7. However, in 
VH, Val2 populates both the -60° and 180° rotamer wells, and most of the structures 
in the ensemble have a xt of -60 for His9, orienting the side chain in the direction of 
20 the turn. Moreover, the twisting of the strands directs the Val2 side chain towards 
the termini of the peptide. Thus, there is little or no contact between these side 
chains, and the backbone hydrogen bond between these residues is more exposed to 
solvent. 

In order to assess whether the asymmetric stabilization seen for the HV 
2S peptide was related to differences in interstand twist imposed by the disulfide bond, 
peptides HV and VH were examined by NMR after reduction. Surprisingly, reduced 
HV has observable residual structure: 4 VnN-Ha coupling constants exceed 8 Hz, 
Leu8 methyl chemical shifts are 0.41 ppm and 0.33 ppm upfield compared to 
random coil, and 3 backbone cross-strand NOEs are observed (Cysl tf* to CyslO 
30 H'^, His2 H^ to Val9 H^ and Glu4 H^ to Lys7 H^). In contrast, there are no 

indications that reduced VH has a preferred conformation. Presumably, Val9 can 
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adopt a similar conformation in the reduced peptide as it does in the oxidized fonn, 
while the strong right-handed twist of the strands preceding position 3 (see above) 
would not allow the Val2 side chain of either form to interact with the C-terminal 
strand. 

5 Taken together, the date shows that disulfide-cyclized P-hairpins can be 

markedly stabihzed by a valine residue immediately preceding the C-terminal 
cysteine, with isoleucine appearing to be nearly as stabilizing. This effect may be 
due to a favorable conformational relationship between Val9 and CyslO, For a 
stable p-tum phage display, the combination of a Tip residue at position 3 or 8 and 

10 the very stabilizing His-Val 2,9-pair reported here would appear especially useful. 
Compared to the least stable combinations in which definite hairpin structure is 
detectable, these substitutions stabilize the fold by more than 1.5 kcal moVK 
Inclusion of the optimal W3-W8 pair should increase stability further, expanding the 
range of P-tums that might be structured. The data further suggests that libraries 

15 with fixed CXnWVC motifs might also be useful in generating structured hairpin 
ligands. 

Example 8. Structure and Activity of the bhpBR3 peptide 

This example describes use of the bhp scaffold to generate a turn peptide that 
mimics binding of the BR3 receptor to its ligand, BAFF. BAFF (also known as 

20 BLyS, TALL-1, zTNF4, THANK and TNFS 13B), a recently defined member of the 
TNF family, is a homotrimeric type 2 transmembrane protein expressed by 
macrophages, monocytes and dendritic cells. Moore et al. (1999) Science 285:260- 
263; Schneider et al. (1999) J, Exp. Med 189:1747-1756. BAFF has been found 
critical for the development and survival of peripheral B cells. Gross et al. (2001) 

25 Immunity 15:2&9'302; Schiemann et al, (2001) Science 293:21 1 1-2114. 

Intriguingly, humans suffering from autoimmune syndromes, including systemic 
lupus erythematosus (SLE), rheumatoid arthritis, and Sjogren's syndrome where end 
organ damage is primarily in the kidneys, joints and salivary/lacrymal glands, 
respectively, have elevated levels of serum BAFF. Furthermore, BAFF levels 

30 correlate with disease severity, consistent with a possible role in the pathogenesis of 
these disabling maladies. Cheema et al. (2001) Arthritis Rheum. 44:1313-1319; 
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Groom et al. (2002) 7. Clin, Invest 109:59-68; Zhang et al. (2001) Immunol 
166:6-10. 

Of the three receptors for BAFF, only BR3 is specific; the other two, TACI 
and BCMA, also bind the related Hgand APRIL. Gross et al. (2000) Nature 
5 404:995-999; Thompson et al. (2001) Science 293:2108-21 1 1; Yan et al. (2001a) 
Curr. Biol. 11:1 547-1552; Yan et al. (2000) Nat. Immunol 1 :37-41 . The 
extracellular domain of TACI has a characteristic TNFR-like structure 
encompassing two cysteine-rich domains (CRDs) that are the hallmark of the TNF 
receptor family. These approximately 40 residue pseudorepeats have a unique 

10 secondary structure, typically characterized by three intrachain disulfides involving 
six highly conserved cysteines. BCMA is unusual in that it contains only a single 
canonical CRD. However, BR3 is even more divergent in that its extracellular 
domain is composed of only a partial CRD, containing four cysteine residues with 
spacing distinct from other TNFR modules characterized previously. Bodmer et al. 

15 (2002) Trends Biochem. Sci. 27:1 9-26; Naismith and Sprang (1 998) Trends 
Biochem. Sci 23:74-79. Conventional members of the TNFR family utilize two 
CRDs for binding ligand; contacts stem primarily from analogous loops from each 
CRD interacting with two distinct surface patches on the ligand (reviewed in 
Bodmer et al., 2002). Thus, how high-affinity binding to BAFF is achieved by only 

20 a single, or partial, CRD like that of BR3 is unclear. Recent crystal structures of 
BAFF reveal a trimeric TNF-like fold with several distinguishing features (Karpusas 
et al. (2002) / Mol Biol 315:1 145-1 154; Liu et al. (2002) Cell 108:383-394; Oren 
et al. (2002) Nat Struct Biol 9:288-292), however no structure of any of the BAFF 
receptors has been described. 

25 Methods and Materials 

Peptide synthesis 

MiniBR3 and bhpBR3 were synthesized as C-terminal amides on a Pioneer 
peptide synthesizer (PE Biosystems) using standard Fmoc chemistry. Their 
sequences are as follows (also shown in Figure 15 A): 

30 
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Mini BR3: TPCVPAECFDLLVRHCVACGLLRTPR (SEQ H) NO:32) 
bhpBR3: CHW DLLVRHW VC (SEQ ID NO:33) 

For minlBRS, the side chain thiols of cysteines 19 and 32 were protected as 
the trifluoroacetic acid (TFA)-stable acetamidomethyl (Acm) derivatives. Peptides 
5 were cleaved from resin by treatment with 5% triisopropyl silane in TFA for 1 .5-4 
hr at room temperature. After removal of TFA by rotary evaporation, peptides were 
precipitated by addition of ethyl ether, then purified by reversed-phase HPLC 
(acetonitrile/H2O/0.1% TFA). Peptide identity was confinned by electrospray mass 
spectrometry. BhpBR3 was converted to the cyclic disulfide by dropwise addition 

10 of a saturated solution ofh (in acetic acid) to HPLC fractions. After lyophilization, 
the oxidized peptide was purified by HPLC. HPLC fractions containing reduced 
miniBR3 were adjusted to a pH of - 9 with NH4OH; the disulfide between cysteines 
24 and 35 was then formed by addition of a small excess of K3Fe(CN)6> and the 
oxidized peptide purified by HPLC. Acm groups were removed (with concomitant 

15 fomiation of the second disulfide) by treatment of the HPLC eluate with a small 
excess ofh over 4 h. The progress of the oxidation was monitored by analytical 
HPLC, and the final product was again purified by HPLC. MiniBR3 was amino- 
terminally biotinylated while on resin, then cleaved and purified exactly as described 
above for the unmodified peptide. 

20 NMR spectroscopy 

Two-dimensional (2D) NMR experiments were acquired and analyzed as 
described in Starovasnik et al., (1996) Biochemistry 35:15558-155569, using a 
Bruker DRX-600 spectrometer at 293K on a sample containing 2.9 mM bhpBR3, 
pH 4.5, with 0.1 mM DSS as a chemical shift reference. Distance restraints were 
25 derived from 2D NOESY spectra (t^ 250 ms); dihedral angle restraints were derived 
from analysis of a 2D DQF-COSY spectrum acquired in 92%H20/8%D20 and a 2D 
COSY-35 spectrum acquired on a sample dissolved in 100% D2O. Complete 
resonance assignments and coupling constant values are included in the 
supplemental data. 
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The three-dimensional structure of bhpBR3 was calculated based on 78 
NOE-derived (including 26 long-range) distance restraints and 17 dihedral angle 
restraints. 100 initial structures were calculated using DGII; 80 of these were further 
refined by restrained molecular dynamics using DISCOVER as described 
5 (Starovasnik et al., (1996) supra). Twenty structures having the lowest restraint 
violation energy and good geometry represent the solution conformation of bhpBR3. 
The model with the lowest nns deviation (RMSD) to the average coordinates of the 
ensemble was chosen as the representative structure (model 1 in the PDB file). The 
final ensemble of twenty models satisfies the input data well, having no distance or 

10 dihedral angle restraint violations greater than 0.1 A or T, respectively. The 
structures are well defined, with an average backbone RMSD to the mean 
coordinates of 0.42 ± 0.07 A, and have good covalent geometry as judged by 
PROCHECK (87% of the residues in the most favored, 9% in the allowed, and 4% 
in the generously allowed regions of (|)\{/ space). Laskowski et al.» (1993) J, AppL 

15 Crystallogr, 26:283-291 . The structure of bhpBRS will be available ftom the RCSB 
Protein Data Bank (ID code xxxx). 

Competitive displacement ELISA 

Nunc Maxisoip 96-welI plates were coated overnight at 4 °C with 100 \x\ of a 
2 fig/ml solution of BAFF in carbonate buffer, pH 9.6. The plate was washed with 

20 PBS and blocked with 1% skim milk in PBS. Serial dilutions of BR3 variants were 
prepared in PBS/0.05% Tween 20 containing 3 ng/ml biotinylated miniBR3. After 
washing with PBS/Tween, 100 jil/well of each dilution was transferred and 
incubated for 1 h at room temperature. The plate was washed with PBS/Tween and 
incubated 15 min with 100 ^1/well of 0.1 U/ml Streptavidin-POD (Boehringer 

25 Mannheim) in PBS/Tween. After washing with PBS/Tween followed by PBS, the 
plate was incubated S min with 100 |il PBS substrate solution containing 0.8 mg/ml 
OPD (Sigma) and 0.01% H2O2. The reaction was quenched with 100 ^l/well IM 
H3PO4 and the plate read at 490 nm. IC50 values were determined by a four- 
parameter fit of the competitive displacement ELISA signal. The concentrations of 

30 initial stock solutions of bhpBR3 were determined spectrophotometrically as 

described (Gill and von Hippel, (1989) Anal Biochem, 182:319-326), while those of 
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miniBR3 and BR3 extracellular domain were determined by quantitative amino acid 
analysis. 

The turn peptide bhpBR3 competitively binds to BAFF 

As a first step towards developing agents to disrupt the BAFF-BR3 
5 interaction, an NMR analysis of the extracellular ligand-binding domain of BR3 was 
performed. Intriguingly, only the central one-third of the protein adopts a stable 
structure in solution; this core is stabilized by two disulfide bonds connecting 
Cysl9/Cys32 and Cys24/Cys35. Consequently, a 26-residue miniBR3 peptide was 
synthesized, incorporating the 1 :3, 2:4 disulfide-bonding pattern, and characterized 
10 structurally (Figure 1 5A). Indeed, NMR spectra of miniBR3 indicated that this 
peptide adopts essentially the same structure as in the context of the full-length 
protein. Importantly, miniBR3 also binds with the same affinity as full-length BR3 
to BAFF (-70 nM IC50; Figure 16A). 

Given that high-affmity BAFF-binding was contained within a 26-residue 
1 5 core, attempts were made to further delineate the BAFF-binding portion of BR3. In 
crystal structures of other TNF-like ligand/receptor complexes, a receptor loop 
analogous to BR3 residues ^^DLLVRH^* is involved in forming direct contacts with 
the ligand. Bodmer et al. (2002) Trends Biochem, Sci 27:19-26. In the context of 
both full-length and miniBR3, NMR analysis suggested that the sequence 
20 ^^DLLVRH^* presents a type I p-tum with backbone hydrogen-bonding between 
Asp26 and His31. Thus, a 12-residue peptide was synthesized in which the six 
residues from BR3 were embedded within a disulfide-bonded p-hairpin (bhp) 
scaffold (Figure 15 A). As described previously, the strong strand-strand interactions 
in these scaffolds can structure a variety of P-tums. 

25 The peptide bhpBR3 adopts a remarkably stable conformation in solution as 

indicated by a high degree of chemical shift dispersion, extreme values for many of 
the backbone and side chain coupling constants, and a large number of long-range 
NOEs present in its NMR spectra. The three-dimensional structure of bhpBR3 
consists of a p-hairpin in which the BR3 tiim sequence adopts the type I p-tum 

30 structure, as expected, with Arg30 adopting a positive ^ angle and the side chains of 
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Asp26, Leu28, Val29, and His31 projecting on one face of the P-tum (Figures 15B 
and 15C). If BR3 binds BAFF using interactions homologous to those observed for 
TNFR and DR5 (Hymowitz et al., (1999) MoL Cell 4:563-571, then one would 
expect this face (the "bottom" face of the turn shown in Figure 15C) to contact 
5 BAFF. 

Because bhpBR3 mimics structurally the BR3 turn, its ability to compete 
with miniBR3 for binding to BAFF was tested in a competitive displacement ELISA 
(Figure 16 A). Remarkably, the 12-residue peptide blocked binding of the larger 
core domain (ICso =15 |iM), indicating that the critical binding determinants do 
10 indeed reside in the six-residue turn shown in Figure 1 5C. Further, the activity of 
bhpBR3 was confirmed in a bioassay: bhpBR3, but not a control hairpin peptide, 
blocked BAFF mediated NF-KB2/p52 induction in primary Bcells (Figure 16B). 

The finding that a |}-tum structure from BR3 has significant affinity for 
BAFF has implications for recognition of BAFF by its other receptors. TACI and 

15 BCMA share homologous sequences in this loop region that would be expected to 
adopt a similar turn conformation to that in BR3. Therefore, the interactions of this 
turn with ligand will likely be a conserved feature of all BAFF/receptor complexes. 
The surprising identification of such a focused recognition epitope will provide the 
framework for developing small-molecule peptidomimetic inhibitors of the BAFF- 

20 BR3 interaction; these inhibitors may have therapeutic potential in the treatment of 
autoimmune diseases such as lupus. 

While the invention has necessarily been described m conjunction with 
preferred embodiments, one of ordinary skill, after reading the foregoing 
25 specification, will be able to effect various changes, substitutions of equivalents, and 
alterations to the subject matter set forth herein, without departing fi'om the spirit 
and scope thereof Hence, the invention can be practiced in ways other than those 
specifically described herein. 
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Claims 

1 . A cyclic peptide scaffold for presenting a p-tum hairpin structure, said cyclic 
peptide scaffold comprises an amino acid sequence Cl-Al-A2-(A3)n-A4-A5-C2 
(SEQ ID N0:1), wherein 

CI and C2 are cysteines; 

Al, A2, A3, A4, and A5 are naturally occurring L-amino acids; 

the N'-terminus of the scaffold is optionally protected with an amino 
protecting group; 

the C'-terminus of the scaffold is optionally protected with a carboxy 
protecting group; 

Al and A5 are selected from the group consisting of amino acids W, Y, F, H, 
I, V, and T; 

A2 and A4 are selected from the group consisting of amino acids W, Y, F, L, 
M, I, and V; 

A3 is any naturally occurring L-amino acid and n is an integer that is 3, 4, S, 
6, 7, 8, 9, 10, 11, or 12; and 

CI and C2 together form a disulfide bond thereby forming a cyclic peptide. 

2. The cyclic peptide scaffold of claim 1 , wherein Al or A5 is a p-branched 
residue having two non-hydrogen substituents on the P-carbon of the amino acid 
residue. 

3. The cyclic peptide scaffold of claim 1, wherein Al or A5 is T. 

4. The cyclic peptide scaffold of claim 1, wherein Al or A5 is amino acid W, F, 
HorY. 

5. The cyclic peptide scaffold of claim 1, wherein Al is H and A5 is V. 
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6. The cyclic pq}tide scafTold of claim 1, wherein A2 or A4 is amino acid W, F 
orY. 

7. The cyclic peptide scaffold of claim 6, wherein A2 and A4 is W. 

8. The cyclic peptide scaffold of claim 7, wherein both A2 and A4 are W. 

9. The cyclic peptide scaffold of any of claims 1-8, wherein n is at least 4. 

10. The cyclic peptide scaffold of claim 9, wherein n is no greater than 10. 

1 1 . The cyclic peptide scaffold of claim 9, wherein n is 4. 

12. The cyclic peptide scaffold of any of claims 1-1 1, wherein (A3)4 is EGNK, 
ENGK, QGSF orVWQL. 

13. The cyclic peptide scaffold of claim 1, wherein Al is T or H, A2 is W or L, 
A4 is W or L and A5 is T or V. 

14. The cyclic peptide scaffold of claim 1, wherein Al is selected from the group 
consisting of H and V, and A5 is selected from the group consisting of V, H and L 

15. The cyclic peptide scaffold of any of claims 1-14, wherein (A3)n is 
DLLVRH. 

16. The cyclic peptide scaffold of claim IS, wherein the amino acid sequence is 
CHWDLLVRHWVC (SEQ ID NO:33). 

17. The cyclic peptide scaffold of claim 13, wherein the amino acid sequence is 
CTWEGNKLTC (SEQ ID N0:2), 

1 8. The cyclic peptide scaffold of claim 1 3, wherein the amino acid sequence is 
CHWEGNKLVC (SEQ ID NO:30). 
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FIG. lA 
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FIG. 2A 
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FIG. 4B 
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FIG.5A 
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FIG. 6 
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FIG. 7 
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FIG. 8A 
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FIG. 10 



o^s-iiscalmol 



•475 cal mol*'' 
L3L8 ►WSLB 

-729calmor^ 



-1 



L3Y8 



W3Y8 



SUBSTITUTE SHEET (RULE 26) 



wo 2004/035735 



12/17 



PCT/US2003/032450 
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FIG. 12 
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FIG. 13 
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FIG. 14 
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FIG. 16A 
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SEQUENCE LISTING 

<110> Genentech, Inc. 

<120> A STRUCTURED PEPTIDE SCAFFOLD FOR DISPLAYING TURN LIBRARIES 
ON PHAGE 

<130> 11669. 116W0I1 

<140> New Filing 

<141> 2003-10-14 

<150> US 10/271,343 

<151> 2002-10-15 

<150> US 09/592,695 

<151> 2000-06-13 

<150> US 60/139,017 

<:151> 1999-06-14 

<160> 60 

<170> Patentin version 3.1 

<210> 1 

<211> 18 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Turn Peptide 
<220> 

<221> MISC_FEATURE 

<222> (2).. (2) 

<223> Xaa is independently Trp, Tyr, Phe, His, lie, Val, or Thr 



<220> 

<221> MISC_FEATURE 

<222> (3) . . (3) 

<223> Xaa is independently Trp, Tyr, Phe, Leu, Met, lie, or Val 



<220> 

<221> MISC_FEATURE 
<222> (4).. (15) 

<223> Xaa can be any naturally occuring L-amino acids and 9 amino acids 
may be absent 



<220> 

<221> MISC_FEATURE 

<222> (16).. (16) 

<223> Xaa is independently Trp, Tyr, Phe, His, He, Val, or Thr 



<220> 

<221> MISC FEATURE 



1/19 



wo 2004/035735 



PCTAJS2003/032450 



<222> (17).. (17) 

<223> Xaa is independently Trp, Tyr, Phe, Leu, Met, lie, or Val 



<400> 1 

Cys Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
15 10 15 



Xaa Cys 



<210> 2 
<211> 10 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Turn Peptide 
<400> 2 

Cys Thr Trp Glu Gly Asn Lys Leu Thr Cys 
15 10 



<210> 


3 


<211> 


12 


<212> 


PRT 


<213> 


Artificial Sequence 


<220> 




<223> 


Turn Peptide 


<400> 


3 



Ser Cys Thr Trp Glu Gly Asn Lys Leu Thr Cys Lys 
15 10 



<210> 


4 


<211> 


10 


<212> 


PRT 


<213> 


Artificial Sequence 


<220> 




<223> 


Turn Peptide 


<400> 


4 


Cys Gly Asn Gin <51y Ser Phe 


1 


5 


<210> 


5 


<211> 


10 


<212> 


PRT 



10 



<213> Artificial Sequence 
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<220> 

<223> Turn Peptide 
<400> 5 

Cys Thr Trp Gin Gly Ser Phe Leu Thr Cys 
15 10 



<210> 


6 


<211> 


12 


<212> 


PRT 


<213> 


Artificial Sequence 


<220> 




<223> 


Turn Peptide 


<400> 


6 



Ser Cys Gly Asn Gin Gly Ser Phe Leu Thr Cys Lys 
15 10 



<210> 


7 


<211> 


12 


<212> 


PRT 


<213> 


Artificial Sequence 


<220> 




<223> 


Turn Peptide 


<400> 


7 



Ser Cys Thr Asn Gin Gly Ser Phe Leu Thr Cys Lys 
15 10 



<210> 


6 


<211> 


12 


<212> 


PRT 


<213> 


Artificial Sequence 


<220> 




<223> 


Turn Peptide 


<400> 


8 



Ser Cys Gly Trp Gin Gly Ser Phe Leu Thr Cys Lys 
15 10 



<210> 9 

<211> 12 

<212> PRT 

<213> Artificial Sec[uence 
<220> 

<223> Turn Peptide 
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<400> 9 

Ser Cys Thr Trp Gin Gly Ser Phe Leu Thr Cys Lys 
15 10 



<210> 


10 


<211> 


12 


<212> 


PRT 


<213> 


Artificial Sequence 


<220> 




<223> 


Turn Peptide 


<400> 


10 



Ser Cys Gly Asn Gin Gly Ser Phe Leu Thr Cys Lys 
15 10 



<210> 


11 




<211> 


12 




<212> 


PRT 




<213> 


Artificial Sequence 




<220> 






<223> 


Turn Peptide 




<400> 


11 





Ser Cys Thr Trp Gin Gly Ser Phe Leu Thr Cys Lys 
1 5 10 



<210> 


12 


<211> 


10 


<212> 


PRT 


<213> 


Artificial Sequence 


<220> 




<223> 


Turn Peptide 


<400> 


12 



Cys Thr Lys Val Trp Gin Leu Trp Thr Cys 
15 10 



<210> 


13 


<211> 


12 


<212> 


PRT 


<213> 


Artificial Sequence 


<220> 




<223> 


Turn Peptide 


<400> 


13 



Ser Cys Thr Trp Val Trp Gin Leu Leu Thr Cys Lys 
1 5 10 
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<210> 14 
<211> 12 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Turn Peptide 
<400> 14 

Ser Cys His Phe Gly Pro Leu Thr Trp Val Cys Lys 
15 10 



<210> 


15 


<211> 


12 


<212> 


PRT 


<213> 


Artificial Sequence 


<220> 




<223> 


Turn Peptide 


<400> 


15 



Ser Cys Thr Trp Gly Pro Leu Thr Leu Thr Cys Lys 
15 10 



<210> 16 

<211> 10 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Turn Peptide 
<220> 

<221> MISC_FEATURE 

<222> (3),. (3) 

<223> Xaa is independently Trp, Tyr, Leu, Val, or Thr 



<400> 16 

Cys Thr Xaa Glu Gly Asn Lys Leu Thr Cys 
15 10 



<210> 17 

<211> 10 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Turn Peptide 
<220> 

<221> MISC FEATURE 
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<222> (3).. (3) 

<223> Xaa is independently Trp, Tyr, Leu, Val, or Thr 



<400> 17 



Cys Thr Xaa Glu Asn Gly Lys Leu Thr Cys 
15 10 



<210> 18 

<211> 10 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Turn Peptide 
<220> 

<221> MISC FEATURE 

<222> (3) .7(3) 

<223> Xaa is independently Trp, Tyr, Leu, Val, or Thr 



<220> 

<221> MISC^FEATURE 

<222> (5)., (5) 

<223> Xaa is D-pro 



<400> 18 

Cys Thr Xaa Glu Xaa Asn Lys Leu Thr Cys 
1 5 10 



<210> 19 

<211> 10 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Turn Peptide 
<220> 

<221> MISC_FEATURE 

<222> (3).. (3) 

<223> Xaa is independently Trp, Tyr, Leu, Val, or Thr 



<220> 

< 2 2 1 > MI SC_FEATURE 

<222> (5).. (5) 

<223> Xaa is D-pro 



<400> 19 

Cys Thr Xaa Glu Xaa Gly Lys Leu Thr Cys 
15 10 
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<210> 20 

<211> 10 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Turn Peptide 
<220> 

<221> MISC__FEATURE 

<222> (3).. (3) 

<223> Xaa is independently Trp, Tyr, Phe, Leu, Met, lie, Val, or Ala 



<400> 20 

Cys Thr Xaa Glu Gly Asn Lys Leu Thr Cys 
15 10 



<210> 21 

<211> 10 

<212> PRT 

<213> Artificial Sequence 

<220> 

<223> Turn Peptide 
<220> 

<221> MISC_FEATURE 

<222> (8).. (8) 

<223> Xaa is independently Trp, Tyr, Phe, Leu, Met, lie, Val, or Ala 



<400> 21 

Cys Thr Leu Glu Gly Asn Lys Xaa Thr Cys 
15 10 



<210> 22 

<211> 10 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Turn Peptide 
<220> 

<221> MISC_FEATURE 

<222> (3).. (3) 

<223> Xaa is independently Trp, Tyr, Phe, Leu, Met, lie, Val, or Ala 



<400> 22 

Cys Thr Xaa Glu Gly Asn Lys Trp Thr Cys 
15 10 



7/19 



wo 2004/035735 



PCT/US2003/032450 



<210> 23 

<211> 10 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Turn Peptide 

<220> 

< 2 2 1 > M I SC_FE ATURE 

<222> (8) . . (8) 

<223> Xaa is independently Trp, Tyr, Phe, Leu, Met, lie, Val, or Ala 



<400> 23 

Cys Thr Trp Glu Gly Asn Lys Xaa Thr Cys 
15 10 



<210> 


24 


<211> 


102 


<212> 


DNA 


<213> 


Artificial Sequence 


<220> 




<223> 


Synthesized Sequence 


<400> 


24 



taataataaa tggctgatcc gaaccgtttc cgcggtaaag atctgggtgg cggtactcca 60 
aacgacccgc caaccactcc accaactgat agcccaggcg gt 102 



<210> 25 

<211> 72 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide 
<220> 

<22l> misc_feature 

<222> (19) . . (20) 

<223> Unknown Base 



<220> 

<22l> misc_feature 

<222> (31) . . (32) 

<223> Unknown Base 



<220> 

<221> (nisc_f eature 

<222> (34) . . (35) 

<223> Unknown Base 
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<220> 

<221> misc_feature 

<222> (37).. (38) 

<223> Un)cnown Base 



<220> 

<221> misc_feature 

<222> (40) . . (41) 

<223> Unknown Base 



<220> 

<221> misc_feature 

<222> (52).. (53) 

<223> Unknown Base 



<400> 25 

tccgcctcgg cttatgcann stgcacttgg nnsnnsnnsn nsctgacttg tnnsatggct 60 
gatccgaacc gt 72 

<210> 26 

<211> 10 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Turn Peptide 
<220> 

<221> MISC FEATURE 

<222> (2) .7(2) 

<223> Xaa is independently Trp, Tyr, Phe, His, lie, Val, or Thr 

<400> 26 

Cys Xaa Trp Glu Gly Asn Lys Leu Thr Cys 
15 10 



<210> 27 

<211> 10 

<212> PRT 

<213> Artificial Sequence 
<220> 

<22 3> Turn Peptide 
<220> 

<221> MISC_FEATURE 

<222> (9).. (9) 

<223> Xaa is independently Trp, Tyr, Phe, His, lie, Val, or Thr 
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<400> 27 

Cys Thr Trp Glu Gly Asn Lys Leu Xaa Cys 
15 10 



<210> 


28 


<211> 


10 


<212> 


PRT 


<213> 


Artificial Sequence 


<220> 




<223> 


Turn Peptide 


<400> 


28 



Cys Thr Trp Lys Gly Asn Glu Leu Thr Cys 
15 10 



<210> 


29 


<211> 


10 


<212> 


PRT 


<213> 


Artificial Sequence 


<220> 




<223> 


Turn Peptide 


<400> 


29 



Cys Val Trp Glu Gly Asn Lys Leu His Cys 
15 10 



<210> 


30 


<211> 


10 


<212> 


PRT 


<213> 


Artificial Sequence 


<220> 




<223> 


Turn Peptide 


<400> 


30 



Cys His Trp Glu Gly Asn Lys Leu Val Cys 
1 5 . 10 



<210> 31 

<211> 10 

<212> PRT 

<213> Artificial Sequence 

<220> 

<223> Turn Peptide 
<220> 

<221> MISC_FEATURE 

<222> (3).. (3) 

<223> Xaa is independently Trp, Tyr, Phe, Leu, Met, lie, or Val 
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<400> 31 

Cys His Xaa Glu Gly Asn Lys Leu Val Cys 
15 10 



<210> 32 

<211> 26 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Peptide 

<400> 32 



Thr Pro Cys Val Pro Ala Glu Cys Phe Asp Leu Leu Val Arg His Cys 
15 10 15 



Val Ala Cys Gly Leu Leu Arg Thr Pro Arg 
20 25 



<210> 


33 


<211> 


12 


<212> 


PRT 


<213> 


Artificial Sequence 


<220> 




<223> 


Peptide 


<400> 


33 


Cys His Trp Asp Leu Leu Val 


1 


5 


<210> 


34 


<211> 


4 


<212> 


PRT 


<213> 


Artificial Sequence 


<220> 





10 



<223> Turn Peptide 
<400> 34 



Glu Gly Asn Lys 
1 



<210> 35 

<211> 4 

<212> PRT 

<213> Artificial Sec[uence 
<220> 
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<223> 


Turn Peptide 


<400> 


35 


Glu Asn Gly Lys 
1 


<210> 
<211> 
<212> 
<213> 


36 
4 

PRT 

Artificial Sequence 


<220> 
<223> 


Turn Peptide 


<400> 


36 


Gin Gly Ser Phe 
1 


<210> 
<211> 
<212> 
<213> 


37 
4 

PRT 

Artificial Sequence 


<220> 
<223> 


Turn Peptide 


<400> 


37 


Val Trp Gin Leu 
1 


<210> 
<211> 

<212> 
<213> 


38 
4 

PRT 

Artificial Sequence 


<220> 
<223> 


Turn Peptide 


<400> 


38 


Gly Pro Leu Thr 
1 


<210> 
<211> 
<212> 
<213> 


39 
16 
PRT 

Artificial Sequence 


<220> 
<223> 


Turn Peptide 


<400> 


39 
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Met Gin He Gly Val Lys Asn Pro Asp Gly Thr He Thr Leu Glu Val 



1 


5 


<210> 


40 


<211> 


16 


<212> 


PRT 


<213> 


Artificial Sequence 


<220> 




<223> 


Turn Peptide 


<220> 




<221> 


MISC FEATURE 


<222> 


(9) . . (9) 


<223> 


Xaa is Ala or Gly 



<400> 40 

Met Gin He Gly Val Lys Ser Pro Xaa Lys Thr He Thr Leu Lys Val 
15 10 15 



<210> 


41 


<211> 


9 


<212> 


PRT 


<213> 


Artificial Sequence 


<220> 




<223> 


Turn Peptide 


<400> 


41 



Tyr Gin Asn Pro Asp Gly Ser Gin Ala 
1 5 



<210> 


42 


<211> 


10 


<212> 


PRT 


<213> 


Artificial Sequence 


<220> 




<223> 


Turn Peptide 


<400> 


42 



He Thr Ser Asn Ser Asp Gly Thr Trp Thr 
15 10 



<210> 43 

<:211> 10 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Turn Peptide 
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<400> 43 

Tyr lie Thr Asn Ser Asp Gly Thr Trp Thr 
15 10 



<210> 


44 


<211> 


12 


<212> 


PRT 


<213> 


Artificial Sequence 


<220> 




<223> 


Turn Peptide 


<400> 


44 



Arg Gly He Thr Val Asn Gly Lys Thr Tyr Gly Arg 
15 10 



<210> 


45 


<211> 


10 


<212> 


PRT 


<213> 


Artificial Sequence 


<220> 




<223> 


Turn Peptide 


<400> 


45 



He Tyr Ser Asn Pro Asp Gly Thr Trp Thr 
15 10 



<210> 46 
<211> 10 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Turn Peptide 
<400> 46 

He Tyr Ser Asn Ser Asp Gly Thr Trp Thr 
15 10 



<210> 47 
<211> 16 
<212> PRT 

<213> Artificial Sequence 

<220> 

<223> Turn Peptide 
<400> 47 

Lys Lys Tyr Thr Val Ser He Asn Gly Lys Lys He Thr Val Ser He 
15 10 15 
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<210> 


48 


<211> 


16 


<212> 


PRT 


<213> 


Artificial Sequence 


<220> 




<223> 


Turn Peptide 


<400> 


4B 



Gly Glu Trp Thr Tyr Asp Asp Ala Thr Lys Thr Phe Thr Val Thr Glu 
1 -5 10 15 



<210> 49 

<211> 12 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Turn Peptide 
<220> 

<221> MISC_FEATURE 

<222> (6) . . (6) 

<223> Xaa is D-pro or L-Asn 



c220> 

<221> MISC_FEATURE 

<222> (8).. (8) 

<223> Xaa is Orn 



<400> 49 

Arg Tyr Val Glu Val Xaa Gly Xaa Lys lie Leu Gin 
15 10 



<210> 50 

<211> 8 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Turn Peptide 
<220> 

<221> MISC_FEATURE 

<222> (1)..(1) 

<223> Xaa is Boc 



<220> 

<221> MISC_FEATURE 

<222> (4) . . (4) 

<223> Xaa is Aib 
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<220> 

<221> MISC_FEATURE 

<222> (8) . . (8) 

<223> Xaa is AsnMe 



<400> 50 

Xaa Cys Leu Xaa Ala Val Cys Xaa 
1 5 



<210> 51 

<211> 8 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Peptide 

<400> 51 

Ala Cys Ser Pro Gly His Cys Glu 
1 5 



<210> 52 

<211> 6 

<212> PRT 

<213> Artificial Sequence 

<220> 

<223> Turn Peptide 



<220> 

<22.1> MISC_FEATURE 

<222> (2).. (2) 

<223> Xaa is unknown 



<220> 

<221> MISC_PEATURE 

<222> (5) . . (5) 

<223> Xaa is unknown 



<400> 52 

Cys Xaa Pro Gly Xaa Cys 
1 5 



<210> 53 

<211> 17 

<212> PRT 

<213> Artificial Sequence 



<220> 
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<223> Turn Peptide 
<400> 53 

Cys Glu Trp Thr Tyr Asp Asp Ala Thr Lys Thr Phe Thr Val Thr Cys 
15 10 15 



Lys 



<210> 54 
<211> 16 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Peptide 
<400> 54 

Gly Cys Lys Pro Thr Phe Arg Arg Leu Lys Trp Lys Tyr Lys Cys Gly 
15 10 15 



<210> 


55 


<211> 


18 


<212> 


PRT 


<213> 


Artificial Sequence 


<220> 




<223> 


Cyclized Peptide 


<400> 


55 



Cys Ala Gly Phe Met Arg lie Arg Gly Arg lie His Pro Leu Cys Met 
1 5 10 15 



Arg Arg 



<210> 56 
<211> 10 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Turn Peptide 
<400> 56 

Phe Cys Asn Gin Gly Ser Phe Leu Cys Tyr 
15 10 



<210> 57 
<211> 12 
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<212> PRT 

<213> Artificial Sequence 

<220> 

<223> Turn Peptide 
<400> 57 

Phe Cys Tyr lie Cys Glu Val Glu Asp Gin Cys Tyr 
15 10 



<210> 


58 


<211> 


11 


<212> 


PRT 


<213> 


Artificial Sequence 


<220> 




<223> 


Turn Peptide 


<400> 


58 



Cys Gly Val Ser Arg Gin Gly Lys Pro Tyr Cys 
15 10 



<210> 59 

<211> 12 

<212> PRT 

<213> Artificial Sequence 

<220> 

<223> Turn Peptide 
<220> 

<221> MISC_FEATURE 

<222> (1) . . (1) 

<223> Xaa is any amino acid 



<220> 

<221> MISC_FEATURE 

<222> (5) , . (8) 

<223> Xaa is any amino acid 



<220> 

<221> MISC_FEATURE 

<222> (12) . . (12) 

<223> Xaa is any amino acid 



<400> 59 

Xaa Cys Thr Trp Xaa Xaa Xaa Xaa Leu Thr Cys Xaa 
15 10 



<210> 60 
<211> 6 
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<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Turn Peptide 

<400> 60 

Asp Leu Leu Val Arg His 
1 5 
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